首页 > 最新文献

Journal of Multivariate Analysis最新文献

英文 中文
Density and graph estimation with smoothing splines and conditional Gaussian graphical models 平滑样条和条件高斯图模型的密度和图估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-14 DOI: 10.1016/j.jmva.2025.105543
Runfei Luo , Anna Liu , Hao Dong , Yuedong Wang
Density estimation and graphical models play important roles in statistical learning. The estimated density can be used to construct a graphical model that reveals conditional relationships, whereas a graphical structure can be used to build models for density estimation. We propose a semiparametric framework that models part of the density function nonparametrically using a smoothing spline ANOVA (SS ANOVA) model and the conditional density parametrically using a conditional Gaussian graphical model (cGGM). This flexible framework allows us to deal with high-dimensional data without the Gaussian assumption. We develop computationally efficient algorithms for estimation and provide theoretical guarantees for our procedure. Our experimental results show that the proposed framework outperforms both parametric and nonparametric baselines.
密度估计和图形模型在统计学习中起着重要的作用。估计的密度可用于构建显示条件关系的图形模型,而图形结构可用于构建密度估计模型。我们提出了一个半参数框架,使用平滑样条方差分析(SS ANOVA)模型非参数地建模部分密度函数,使用条件高斯图形模型(cGGM)参数地建模部分条件密度函数。这个灵活的框架允许我们在没有高斯假设的情况下处理高维数据。我们开发了计算效率高的估计算法,并为我们的程序提供了理论保证。我们的实验结果表明,所提出的框架优于参数基线和非参数基线。
{"title":"Density and graph estimation with smoothing splines and conditional Gaussian graphical models","authors":"Runfei Luo ,&nbsp;Anna Liu ,&nbsp;Hao Dong ,&nbsp;Yuedong Wang","doi":"10.1016/j.jmva.2025.105543","DOIUrl":"10.1016/j.jmva.2025.105543","url":null,"abstract":"<div><div>Density estimation and graphical models play important roles in statistical learning. The estimated density can be used to construct a graphical model that reveals conditional relationships, whereas a graphical structure can be used to build models for density estimation. We propose a semiparametric framework that models part of the density function nonparametrically using a smoothing spline ANOVA (SS ANOVA) model and the conditional density parametrically using a conditional Gaussian graphical model (cGGM). This flexible framework allows us to deal with high-dimensional data without the Gaussian assumption. We develop computationally efficient algorithms for estimation and provide theoretical guarantees for our procedure. Our experimental results show that the proposed framework outperforms both parametric and nonparametric baselines.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105543"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145537555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stochastic arrangement increasing property of skew-normal distributions 偏正态分布的随机排列递增特性
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-17 DOI: 10.1016/j.jmva.2025.105544
Jiajie Lu, Xiaohu Li
In this study, we investigate both sufficient and necessary conditions for bivariate skew-normal distributions to be stochastic arrangement increasing. The main results serve as either natural extension of or nice supplement to the characterization result of this property for bivariate normal distributions due to Cai and Wei (2015). Also, we generalize these results to multivariate skew-normal distributions. Numerical examples based on the theory and a real data are presented to illustrate the main results as well.
本文研究了二元偏正态分布是随机排列递增的充要条件。主要结果是Cai和Wei(2015)对二元正态分布的这一性质的表征结果的自然扩展或很好的补充。此外,我们将这些结果推广到多元偏正态分布。最后给出了基于理论和实际数据的数值算例来说明主要结果。
{"title":"Stochastic arrangement increasing property of skew-normal distributions","authors":"Jiajie Lu,&nbsp;Xiaohu Li","doi":"10.1016/j.jmva.2025.105544","DOIUrl":"10.1016/j.jmva.2025.105544","url":null,"abstract":"<div><div>In this study, we investigate both sufficient and necessary conditions for bivariate skew-normal distributions to be stochastic arrangement increasing. The main results serve as either natural extension of or nice supplement to the characterization result of this property for bivariate normal distributions due to Cai and Wei (2015). Also, we generalize these results to multivariate skew-normal distributions. Numerical examples based on the theory and a real data are presented to illustrate the main results as well.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105544"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145537557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Varying-coefficient quantile regression with effect under panel data and missing observation 在面板数据和缺失观测下的变系数分位数回归
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105561
Shu-Yu Li , Han-Ying Liang , Bao-Hua Wang
We, in this paper, focus on partial linear varying-coefficient quantile regression with fixed effects under panel data and missing observations, where the missing observations include either responses or covariates are missing at random. Under independent setting, we define estimators of the unknown parameter vector, varying-coefficient function and effect in the model, and discuss their large number properties. To use the information from within-subject correlations, we propose weighted estimators for the unknown amounts, where the weights are chosen based on mean and quantile regressions, respectively, by quadratic inference technique and empirical likelihood method. Under dependent assumption, we establish asymptotic normality of the weighted estimators. Meanwhile, we study hypothesis tests of the parameter, varying-coefficient function and effect, and prove asymptotic distributions of restricted estimators and test statistics of the parameter under null hypothesis and local alternative hypothesis, respectively. Also, oracle property of the parameter is considered. Simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.
本文主要研究面板数据和缺失观测值下固定效应的偏线性变系数分位数回归,其中缺失观测值包括随机缺失的响应或随机缺失的协变量。在独立设定下,定义了模型中未知参数向量、变系数函数和效应的估计量,并讨论了它们的大数性质。为了利用主体内相关性的信息,我们提出了未知数量的加权估计,其中权重分别基于均值和分位数回归,通过二次推理技术和经验似然方法选择。在相关假设下,我们建立了加权估计量的渐近正态性。同时,研究了参数、变系数函数和效应的假设检验,分别证明了零假设和局部备择假设下的约束估计量的渐近分布和参数的统计量检验。同时,还考虑了参数的oracle属性。通过仿真研究和实际数据分析,对所提方法的性能进行了评价。
{"title":"Varying-coefficient quantile regression with effect under panel data and missing observation","authors":"Shu-Yu Li ,&nbsp;Han-Ying Liang ,&nbsp;Bao-Hua Wang","doi":"10.1016/j.jmva.2025.105561","DOIUrl":"10.1016/j.jmva.2025.105561","url":null,"abstract":"<div><div>We, in this paper, focus on partial linear varying-coefficient quantile regression with fixed effects under panel data and missing observations, where the missing observations include either responses or covariates are missing at random. Under independent setting, we define estimators of the unknown parameter vector, varying-coefficient function and effect in the model, and discuss their large number properties. To use the information from within-subject correlations, we propose weighted estimators for the unknown amounts, where the weights are chosen based on mean and quantile regressions, respectively, by quadratic inference technique and empirical likelihood method. Under dependent assumption, we establish asymptotic normality of the weighted estimators. Meanwhile, we study hypothesis tests of the parameter, varying-coefficient function and effect, and prove asymptotic distributions of restricted estimators and test statistics of the parameter under null hypothesis and local alternative hypothesis, respectively. Also, oracle property of the parameter is considered. Simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105561"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fréchet kNN-based sufficient dimension reduction 基于knn的fresamet充分降维
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105566
Xueyan Huang, Rui Qiu, Zhou Yu
In this paper, we introduce two Fréchet inverse regression methods with kernel bandwidths determined by k nearest neighbors, designed to achieve sufficient dimension reduction for a metric space-valued response and Euclidean predictors. A key advantage of the proposals lies in their ability to effectively preserve the intrinsic information of the metric space-valued response. We establish the asymptotic normality of these methods through rigorous theoretical proofs. Additionally, simulations and a real data example are provided to validate the performance and practical applicability of the proposed methods.
在本文中,我们介绍了两种核带宽由k近邻决定的fracimet逆回归方法,旨在为度量空间值响应和欧几里得预测实现足够的降维。这些建议的一个关键优势在于它们能够有效地保留度量空间值响应的内在信息。通过严格的理论证明,建立了这些方法的渐近正态性。最后,通过仿真和一个实际数据算例验证了所提方法的性能和实用性。
{"title":"Fréchet kNN-based sufficient dimension reduction","authors":"Xueyan Huang,&nbsp;Rui Qiu,&nbsp;Zhou Yu","doi":"10.1016/j.jmva.2025.105566","DOIUrl":"10.1016/j.jmva.2025.105566","url":null,"abstract":"<div><div>In this paper, we introduce two Fréchet inverse regression methods with kernel bandwidths determined by <span><math><mi>k</mi></math></span> nearest neighbors, designed to achieve sufficient dimension reduction for a metric space-valued response and Euclidean predictors. A key advantage of the proposals lies in their ability to effectively preserve the intrinsic information of the metric space-valued response. We establish the asymptotic normality of these methods through rigorous theoretical proofs. Additionally, simulations and a real data example are provided to validate the performance and practical applicability of the proposed methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105566"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Type I multivariate Pólya-Aeppli distributions with applications 带应用程序的I型多变量Pólya-Aeppli发行版
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105556
Claire Geldenhuys, Rene Ehlers, Andriette Bekker
An extensive body of literature exists that specifically addresses the univariate case of zero-inflated count models. In contrast, research pertaining to multivariate models is notably less developed. We propose two new parsimonious multivariate models that can be used to model correlated multivariate overdispersed count data. Furthermore, for different parameter settings and sample sizes, various simulations are performed. In conclusion, we demonstrate the performance of the newly proposed multivariate candidates on two benchmark datasets, which surpasses that of several alternative approaches.
有大量的文献专门论述了零膨胀计数模型的单变量情况。相比之下,与多变量模型有关的研究明显欠发达。我们提出了两个新的简化的多元模型,可以用来模拟相关的多元过分散计数数据。此外,对于不同的参数设置和样本量,进行了各种模拟。总之,我们证明了新提出的多变量候选算法在两个基准数据集上的性能优于几种替代方法。
{"title":"Type I multivariate Pólya-Aeppli distributions with applications","authors":"Claire Geldenhuys,&nbsp;Rene Ehlers,&nbsp;Andriette Bekker","doi":"10.1016/j.jmva.2025.105556","DOIUrl":"10.1016/j.jmva.2025.105556","url":null,"abstract":"<div><div>An extensive body of literature exists that specifically addresses the univariate case of zero-inflated count models. In contrast, research pertaining to multivariate models is notably less developed. We propose two new parsimonious multivariate models that can be used to model correlated multivariate overdispersed count data. Furthermore, for different parameter settings and sample sizes, various simulations are performed. In conclusion, we demonstrate the performance of the newly proposed multivariate candidates on two benchmark datasets, which surpasses that of several alternative approaches.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105556"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the two-sample Behrens–Fisher problem for high-dimensional data 高维数据的双样本Behrens-Fisher问题
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105572
Yongshuai Chen , Gongming Shi , Xiaomeng Yan, Baoxue Zhang
In this paper, we study the limiting distribution of Chen-Qin’s test statistic and propose a novel weighted bootstrap test procedure for the high-dimensional two-sample Behrens–Fisher problem. We first show that the test statistic has an asymptotic null that is a mixture of a chi-square-type mixture distribution and a normal distribution, without imposing either the normal assumption or a factor-like model assumption on the underlying distributions. To gain insight into the asymptotic null distribution of the test statistic, we show that under stronger restrictions on the covariance matrices and the null hypothesis, the test statistic is either asymptotically normal or a chi-square-type mixture distribution. The power properties of the test are evaluated asymptotically under the high-dimensional local and fixed alternative hypothesis. We also derive that the proposed weighted bootstrap test procedure has correct test level asymptotically. Two simulation studies and a real data example show that the new weighted bootstrap procedure significantly outperforms other benchmarks in terms of size control and is comparable in terms of power.
本文研究了Chen-Qin检验统计量的极限分布,提出了一种新的高维双样本Behrens-Fisher问题的加权自举检验方法。我们首先证明检验统计量有一个渐近零值,它是卡方型混合分布和正态分布的混合物,而没有对底层分布施加正态假设或类因子模型假设。为了深入了解检验统计量的渐近零分布,我们表明,在对协方差矩阵和零假设的更强限制下,检验统计量要么是渐近正态分布,要么是卡方型混合分布。在高维局部定备假设下,渐近地评价了检验的幂性质。并渐近地证明了所提出的加权自举检验方法具有正确的检验水平。两个仿真研究和一个实际数据示例表明,新的加权自举过程在大小控制方面明显优于其他基准程序,并且在功率方面具有可比性。
{"title":"On the two-sample Behrens–Fisher problem for high-dimensional data","authors":"Yongshuai Chen ,&nbsp;Gongming Shi ,&nbsp;Xiaomeng Yan,&nbsp;Baoxue Zhang","doi":"10.1016/j.jmva.2025.105572","DOIUrl":"10.1016/j.jmva.2025.105572","url":null,"abstract":"<div><div>In this paper, we study the limiting distribution of Chen-Qin’s test statistic and propose a novel weighted bootstrap test procedure for the high-dimensional two-sample Behrens–Fisher problem. We first show that the test statistic has an asymptotic null that is a mixture of a chi-square-type mixture distribution and a normal distribution, without imposing either the normal assumption or a factor-like model assumption on the underlying distributions. To gain insight into the asymptotic null distribution of the test statistic, we show that under stronger restrictions on the covariance matrices and the null hypothesis, the test statistic is either asymptotically normal or a chi-square-type mixture distribution. The power properties of the test are evaluated asymptotically under the high-dimensional local and fixed alternative hypothesis. We also derive that the proposed weighted bootstrap test procedure has correct test level asymptotically. Two simulation studies and a real data example show that the new weighted bootstrap procedure significantly outperforms other benchmarks in terms of size control and is comparable in terms of power.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105572"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145681890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Jump detection in single-index models with measurement error 具有测量误差的单指标模型的跳跃检测
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105574
Yuan Liu , Yan-Yong Zhao , Noriszura Ismail , Razik Ridzuan Mohd Tajuddin , Yuchun Zhang
Measurement error regression is widely used in statistical modeling. When the regression function is discontinuous, the estimation and inference have become challenging. In this paper, we develop a jump detection framework for a single index model with measurement error. First, for the single index model with measurement error, the consistent estimator of the index coefficient is obtained by using both the SIMEX (simulation extrapolation) and estimation equation methods. Then, the one-sided kernel local linear method is used to construct the estimator of the nonparametric function and the estimator of the jump point. Under some regularity assumptions, the asymptotic properties of the resultant estimators are established. The finite sample performance of our methodologies is evaluated by numerical simulation, and finally they are used to analyze the effect of serum cholesterol level and age on male blood.
测量误差回归在统计建模中得到了广泛的应用。当回归函数不连续时,估计和推理就变得很有挑战性。在本文中,我们开发了一个具有测量误差的单指标模型的跳跃检测框架。首先,针对具有测量误差的单指标模型,采用模拟外推法和估计方程法得到指标系数的一致估计量;然后,利用单侧核局部线性方法构造了非参数函数的估计量和跳点的估计量。在一些正则性假设下,给出了合成估计量的渐近性质。通过数值模拟对方法的有限样本性能进行了评价,最后应用该方法分析了血清胆固醇水平和年龄对男性血液的影响。
{"title":"Jump detection in single-index models with measurement error","authors":"Yuan Liu ,&nbsp;Yan-Yong Zhao ,&nbsp;Noriszura Ismail ,&nbsp;Razik Ridzuan Mohd Tajuddin ,&nbsp;Yuchun Zhang","doi":"10.1016/j.jmva.2025.105574","DOIUrl":"10.1016/j.jmva.2025.105574","url":null,"abstract":"<div><div>Measurement error regression is widely used in statistical modeling. When the regression function is discontinuous, the estimation and inference have become challenging. In this paper, we develop a jump detection framework for a single index model with measurement error. First, for the single index model with measurement error, the consistent estimator of the index coefficient is obtained by using both the SIMEX (simulation extrapolation) and estimation equation methods. Then, the one-sided kernel local linear method is used to construct the estimator of the nonparametric function and the estimator of the jump point. Under some regularity assumptions, the asymptotic properties of the resultant estimators are established. The finite sample performance of our methodologies is evaluated by numerical simulation, and finally they are used to analyze the effect of serum cholesterol level and age on male blood.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105574"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145616473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing multivariate normality for two-level structural equation models 两级结构方程模型的多元正态性检验
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-28 DOI: 10.1016/j.jmva.2025.105562
Jiajuan Liang , Peter M. Bentler , Yiwen Cao
Multivariate normality is a common assumption in the maximum likelihood analysis of two-level structural equation models. Under the normal assumption, the independence condition on level-1 observations is no longer satisfied. As a result, existing statistics for testing multivariate normality based independent observations cannot be directly used for the same purpose in two-level structural equation models. In this paper we tackle this problem by employing the theory of spherical matrix distributions and some properties of invariant statistics. A series of necessary tests are constructed from some existing invariant statistics with a balanced level-1 sample design. These necessary tests are applicable without requiring a large level-1 or level-2 sample size. A Monte Carlo study is carried out to demonstrate the performance of the proposed tests in the aspects of controlling type I error rates, the power against a departure from multivariate normality for level-1 variables, and the power against a departure from multivariate normality for level-2 variables. An application of the necessary tests to a practical data set is illustrated.
多元正态性是两级结构方程模型最大似然分析中常见的假设。在正态假设下,一级观测值的独立性不再满足。因此,现有的用于检验基于独立观测的多元正态性的统计量不能直接用于两层结构方程模型的相同目的。本文利用球矩阵分布理论和不变统计的一些性质来解决这一问题。在平衡的一级样本设计下,从现有的一些不变统计量构造了一系列必要的检验。这些必要的测试适用于不需要大的一级或二级样本量。通过蒙特卡罗研究证明了所提出的测试在控制I型错误率、1级变量的抗偏离多元正态性的能力以及2级变量的抗偏离多元正态性的能力方面的性能。并举例说明了必要的测试在实际数据集上的应用。
{"title":"Testing multivariate normality for two-level structural equation models","authors":"Jiajuan Liang ,&nbsp;Peter M. Bentler ,&nbsp;Yiwen Cao","doi":"10.1016/j.jmva.2025.105562","DOIUrl":"10.1016/j.jmva.2025.105562","url":null,"abstract":"<div><div>Multivariate normality is a common assumption in the maximum likelihood analysis of two-level structural equation models. Under the normal assumption, the independence condition on level-1 observations is no longer satisfied. As a result, existing statistics for testing multivariate normality based independent observations cannot be directly used for the same purpose in two-level structural equation models. In this paper we tackle this problem by employing the theory of spherical matrix distributions and some properties of invariant statistics. A series of necessary tests are constructed from some existing invariant statistics with a balanced level-1 sample design. These necessary tests are applicable without requiring a large level-1 or level-2 sample size. A Monte Carlo study is carried out to demonstrate the performance of the proposed tests in the aspects of controlling type I error rates, the power against a departure from multivariate normality for level-1 variables, and the power against a departure from multivariate normality for level-2 variables. An application of the necessary tests to a practical data set is illustrated.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105562"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145616475","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Wasserstein projection pursuit of non-Gaussian signals 非高斯信号的Wasserstein投影追踪
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-08 DOI: 10.1016/j.jmva.2025.105535
Satyaki Mukherjee , Soumendu Sundar Mukherjee , Debarghya Ghoshdastidar
We consider the general dimensionality reduction problem of locating in a high-dimensional data cloud, a k-dimensional non-Gaussian subspace of interesting features. We use a projection pursuit approach—we search for mutually orthogonal unit directions which maximise the q-Wasserstein distance of the empirical distribution of data-projections along these directions from a standard Gaussian. Under a generative model, where there is a underlying (unknown) low-dimensional non-Gaussian subspace, we prove rigorous statistical guarantees on the accuracy of approximating this unknown subspace by the directions found by our projection pursuit approach. Our results operate in the regime where the data dimensionality is comparable to the sample size, and thus supplement the recent literature on the non-feasibility of locating interesting directions via projection pursuit in the complementary regime where the data dimensionality is much larger than the sample size.
我们考虑一般的降维问题定位在一个高维数据云,一个k维的非高斯子空间的有趣的特征。我们使用投影追踪方法-我们搜索相互正交的单位方向,使数据的经验分布的q-Wasserstein距离最大化-沿着这些方向从标准高斯分布的投影。在生成模型下,其中存在一个潜在的(未知的)低维非高斯子空间,我们证明了通过我们的投影追踪方法找到的方向逼近该未知子空间的准确性的严格统计保证。我们的结果在数据维数与样本量相当的情况下运行,从而补充了最近关于在数据维数远大于样本量的互补情况下通过投影追踪定位感兴趣方向的不可行性的文献。
{"title":"Wasserstein projection pursuit of non-Gaussian signals","authors":"Satyaki Mukherjee ,&nbsp;Soumendu Sundar Mukherjee ,&nbsp;Debarghya Ghoshdastidar","doi":"10.1016/j.jmva.2025.105535","DOIUrl":"10.1016/j.jmva.2025.105535","url":null,"abstract":"<div><div>We consider the general dimensionality reduction problem of locating in a high-dimensional data cloud, a <span><math><mi>k</mi></math></span>-dimensional non-Gaussian subspace of interesting features. We use a projection pursuit approach—we search for mutually orthogonal unit directions which maximise the <span><math><mi>q</mi></math></span>-Wasserstein distance of the empirical distribution of data-projections along these directions from a standard Gaussian. Under a generative model, where there is a underlying (unknown) low-dimensional non-Gaussian subspace, we prove rigorous statistical guarantees on the accuracy of approximating this unknown subspace by the directions found by our projection pursuit approach. Our results operate in the regime where the data dimensionality is comparable to the sample size, and thus supplement the recent literature on the non-feasibility of locating interesting directions via projection pursuit in the complementary regime where the data dimensionality is much larger than the sample size.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105535"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145537558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonlinear functional principal component analysis using neural networks 基于神经网络的非线性泛函主成分分析
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2026-03-01 Epub Date: 2025-11-15 DOI: 10.1016/j.jmva.2025.105526
Rou Zhong , Jingxiao Zhang , Chunming Zhang
Functional principal component analysis (FPCA) is an important technique for dimension reduction in functional data analysis (FDA). Classical FPCA method is based on the Karhunen-Loève expansion, which assumes a linear structure of the observed functional data. However, the assumption may not always be satisfied, and the FPCA method can become inefficient when the data deviates from the linear assumption. In this paper, we propose a novel FPCA method that is suitable for data with a nonlinear structure with the use of neural networks. We construct networks that can be applied to functional data and explore the corresponding universal approximation property. The main use of our proposed nonlinear FPCA method is curve reconstruction. We conduct a simulation study to evaluate the performance of our method. The proposed method is also applied to a real-world data set to further demonstrate its superiority.
功能主成分分析(FPCA)是功能数据分析(FDA)中重要的降维技术。经典的FPCA方法基于karhunen - lo展开式,它假定观测到的函数数据具有线性结构。然而,假设并不总是满足的,当数据偏离线性假设时,FPCA方法会变得低效。在本文中,我们提出了一种新的FPCA方法,该方法适用于具有非线性结构的数据,并使用神经网络。我们构建了可以应用于函数数据的网络,并探索了相应的普遍逼近性质。本文提出的非线性FPCA方法的主要用途是曲线重构。我们进行了模拟研究来评估我们的方法的性能。并将该方法应用于实际数据集,进一步证明了其优越性。
{"title":"Nonlinear functional principal component analysis using neural networks","authors":"Rou Zhong ,&nbsp;Jingxiao Zhang ,&nbsp;Chunming Zhang","doi":"10.1016/j.jmva.2025.105526","DOIUrl":"10.1016/j.jmva.2025.105526","url":null,"abstract":"<div><div>Functional principal component analysis (FPCA) is an important technique for dimension reduction in functional data analysis (FDA). Classical FPCA method is based on the Karhunen-Loève expansion, which assumes a linear structure of the observed functional data. However, the assumption may not always be satisfied, and the FPCA method can become inefficient when the data deviates from the linear assumption. In this paper, we propose a novel FPCA method that is suitable for data with a nonlinear structure with the use of neural networks. We construct networks that can be applied to functional data and explore the corresponding universal approximation property. The main use of our proposed nonlinear FPCA method is curve reconstruction. We conduct a simulation study to evaluate the performance of our method. The proposed method is also applied to a real-world data set to further demonstrate its superiority.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"212 ","pages":"Article 105526"},"PeriodicalIF":1.4,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145537559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Multivariate Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1