首页 > 最新文献

Journal of Multivariate Analysis最新文献

英文 中文
A sparse dimension-reduced subspace-based approach for detecting multiple change points in high-dimensional data 一种基于稀疏降维子空间的高维数据多变化点检测方法
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-27 DOI: 10.1016/j.jmva.2025.105594
Luoyao Yu , Rongzhu Zhao , Jiaqi Huang , Lixing Zhu , Xuehu Zhu
This paper develops a novel penalized matrix estimation method for sparse dimension reduction when detecting change points in high-dimensional data. The strategy is to project high-dimensional data onto a low-dimensional subspace without losing any change point information, enabling efficient change point detection within this dimension-reduced subspace. Theoretical analysis establishes the consistency of the proposed matrix estimation and selects consistently the important variables which have change points. Numerical studies on synthetic and several real data sets suggest that the dimension reduction strategy enhances the performance of existing approaches. Additionally, the results showcase the efficiency of the proposed algorithm for selecting important variables in high-dimensional sparse data.
针对高维数据中变化点的稀疏降维问题,提出了一种新的惩罚矩阵估计方法。该策略是将高维数据投影到低维子空间中,而不会丢失任何更改点信息,从而在该降维子空间中实现有效的更改点检测。理论分析建立了所提出的矩阵估计的一致性,并一致地选择了具有变化点的重要变量。对合成数据集和一些实际数据集的数值研究表明,降维策略提高了现有方法的性能。此外,实验结果表明了该算法在高维稀疏数据中选取重要变量的有效性。
{"title":"A sparse dimension-reduced subspace-based approach for detecting multiple change points in high-dimensional data","authors":"Luoyao Yu ,&nbsp;Rongzhu Zhao ,&nbsp;Jiaqi Huang ,&nbsp;Lixing Zhu ,&nbsp;Xuehu Zhu","doi":"10.1016/j.jmva.2025.105594","DOIUrl":"10.1016/j.jmva.2025.105594","url":null,"abstract":"<div><div>This paper develops a novel penalized matrix estimation method for sparse dimension reduction when detecting change points in high-dimensional data. The strategy is to project high-dimensional data onto a low-dimensional subspace without losing any change point information, enabling efficient change point detection within this dimension-reduced subspace. Theoretical analysis establishes the consistency of the proposed matrix estimation and selects consistently the important variables which have change points. Numerical studies on synthetic and several real data sets suggest that the dimension reduction strategy enhances the performance of existing approaches. Additionally, the results showcase the efficiency of the proposed algorithm for selecting important variables in high-dimensional sparse data.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105594"},"PeriodicalIF":1.4,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subgroup effect quantile regression with high dimensional missing panel data 高维缺失面板数据的亚组效应分位数回归
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-23 DOI: 10.1016/j.jmva.2025.105593
Shu-Yu Li, Han-Ying Liang
Based on panel data, we explore partially linear varying-coefficient quantile regression with group effects under high dimension and missing observations. Using generalized estimating equations, we construct oracle estimators along with smoothed version for the unknown parameter vector, varying-coefficient functions as well as group effects, and establish their asymptotic normality. In the estimation procedure, the within-subject correlations of the panel data are considered by introducing working correlation matrix. We further investigate variable selection by the SCAD penalty for the parameters, varying-coefficient functions and group identification simultaneously, and discuss oracle properties. Meanwhile, hypothesis tests for the parameter, varying-coefficient functions and group effects are done, asymptotic distributions of the restricted estimators and test statistics under both the null and local alternative hypotheses are analyzed. Also, simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.
基于面板数据,我们探索了在高维和缺失观测值下具有组效应的部分线性变系数分位数回归。利用广义估计方程,构造了未知参数向量、变系数函数和群效应的oracle估计量及其光滑版本,并建立了它们的渐近正态性。在估计过程中,通过引入工作相关矩阵来考虑面板数据的主体内相关性。我们进一步研究了SCAD惩罚对参数、变系数函数和组识别的变量选择,并讨论了oracle的性质。同时,对参数函数、变系数函数和群效应进行了假设检验,分析了零假设和局部备用假设下的限制估计量和检验统计量的渐近分布。通过仿真研究和实际数据分析,对所提方法的性能进行了评价。
{"title":"Subgroup effect quantile regression with high dimensional missing panel data","authors":"Shu-Yu Li,&nbsp;Han-Ying Liang","doi":"10.1016/j.jmva.2025.105593","DOIUrl":"10.1016/j.jmva.2025.105593","url":null,"abstract":"<div><div>Based on panel data, we explore partially linear varying-coefficient quantile regression with group effects under high dimension and missing observations. Using generalized estimating equations, we construct oracle estimators along with smoothed version for the unknown parameter vector, varying-coefficient functions as well as group effects, and establish their asymptotic normality. In the estimation procedure, the within-subject correlations of the panel data are considered by introducing working correlation matrix. We further investigate variable selection by the SCAD penalty for the parameters, varying-coefficient functions and group identification simultaneously, and discuss oracle properties. Meanwhile, hypothesis tests for the parameter, varying-coefficient functions and group effects are done, asymptotic distributions of the restricted estimators and test statistics under both the null and local alternative hypotheses are analyzed. Also, simulation study and real data analysis are conducted to evaluate the performance of the proposed methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105593"},"PeriodicalIF":1.4,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Envelope-based partial least squares in functional regression 函数回归中基于包络的偏最小二乘
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-23 DOI: 10.1016/j.jmva.2025.105592
Minxuan Wu , Joseph Antonelli , Zhihua Su
In this article, we extend predictor envelope models to settings with multivariate outcomes and multiple, functional predictors. We propose a two-step estimation strategy, which first projects the function onto a finite-dimensional Euclidean space before fitting the model using existing approaches to envelope models. We first develop an estimator under a linear model with continuous outcomes and then extend this procedure to the more general class of generalized linear models, which allow for a variety of outcome types. We provide asymptotic theory for these estimators showing that they are root-n consistent and asymptotically normal when the regression coefficient is finite-rank. Additionally we show that consistency can be obtained even when the regression coefficient has rank that grows with the sample size. Extensive simulation studies confirm our theoretical results and show strong prediction performance of the proposed estimators. Additionally, we provide multiple data analyses showing that the proposed approach performs well in real-world settings under a variety of outcome types compared with existing dimension reduction approaches.
在本文中,我们将预测包络模型扩展到具有多变量结果和多个功能预测因子的设置。我们提出了一种两步估计策略,首先将函数投影到有限维欧几里得空间,然后使用现有的包络模型方法拟合模型。我们首先在具有连续结果的线性模型下开发一个估计量,然后将此过程扩展到更一般的广义线性模型类,它允许各种结果类型。我们给出了这些估计量的渐近理论,表明当回归系数为有限秩时,它们是根n一致且渐近正态的。此外,我们还表明,即使回归系数的秩随着样本量的增加而增加,也可以获得一致性。大量的模拟研究证实了我们的理论结果,并显示了所提出的估计器的强大预测性能。此外,我们提供的多个数据分析表明,与现有的降维方法相比,所提出的方法在各种结果类型的现实环境中表现良好。
{"title":"Envelope-based partial least squares in functional regression","authors":"Minxuan Wu ,&nbsp;Joseph Antonelli ,&nbsp;Zhihua Su","doi":"10.1016/j.jmva.2025.105592","DOIUrl":"10.1016/j.jmva.2025.105592","url":null,"abstract":"<div><div>In this article, we extend predictor envelope models to settings with multivariate outcomes and multiple, functional predictors. We propose a two-step estimation strategy, which first projects the function onto a finite-dimensional Euclidean space before fitting the model using existing approaches to envelope models. We first develop an estimator under a linear model with continuous outcomes and then extend this procedure to the more general class of generalized linear models, which allow for a variety of outcome types. We provide asymptotic theory for these estimators showing that they are root-<span><math><mi>n</mi></math></span> consistent and asymptotically normal when the regression coefficient is finite-rank. Additionally we show that consistency can be obtained even when the regression coefficient has rank that grows with the sample size. Extensive simulation studies confirm our theoretical results and show strong prediction performance of the proposed estimators. Additionally, we provide multiple data analyses showing that the proposed approach performs well in real-world settings under a variety of outcome types compared with existing dimension reduction approaches.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105592"},"PeriodicalIF":1.4,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837543","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AUK-based test for mutual independence and an index of mutual dependence 基于auc的相互独立测试和相互依赖指数
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-22 DOI: 10.1016/j.jmva.2025.105589
Georgios Afendras , Marianthi Markatou , Nickos Papantonis
We offer a novel test of mutual independence based on consistent estimates of the area under the Kendall curve. We also present an index of dependence that allows one to measure the mutual dependence of a d-dimensional random vector with d>2. The index is based on a d-dimensional Kendall process. We discuss a standardized version of our index of dependence that is easy to interpret, and provide an algorithm for its computation. Based on the proposed index of dependence, we exemplify a novel method for searching for patterns in the dependence structure. We evaluate the performance of our procedures via simulation, and apply our methods to a real data set.
我们提供了一种基于肯德尔曲线下面积的一致估计的相互独立性的新测试。我们还提出了一个依赖指数,它允许人们用d>;2来测量d维随机向量的相互依赖性。该指数基于一个d维肯德尔过程。我们讨论了一个易于解释的依赖指数的标准化版本,并提供了一个计算它的算法。基于提出的依赖指数,给出了一种新的依赖结构模式搜索方法。我们通过模拟来评估我们的程序的性能,并将我们的方法应用于真实的数据集。
{"title":"AUK-based test for mutual independence and an index of mutual dependence","authors":"Georgios Afendras ,&nbsp;Marianthi Markatou ,&nbsp;Nickos Papantonis","doi":"10.1016/j.jmva.2025.105589","DOIUrl":"10.1016/j.jmva.2025.105589","url":null,"abstract":"<div><div>We offer a novel test of mutual independence based on consistent estimates of the area under the Kendall curve. We also present an index of dependence that allows one to measure the mutual dependence of a <span><math><mi>d</mi></math></span>-dimensional random vector with <span><math><mrow><mi>d</mi><mo>&gt;</mo><mn>2</mn></mrow></math></span>. The index is based on a <span><math><mi>d</mi></math></span>-dimensional Kendall process. We discuss a standardized version of our index of dependence that is easy to interpret, and provide an algorithm for its computation. Based on the proposed index of dependence, we exemplify a novel method for searching for patterns in the dependence structure. We evaluate the performance of our procedures via simulation, and apply our methods to a real data set.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"214 ","pages":"Article 105589"},"PeriodicalIF":1.4,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146170929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic sparse estimation of the high-dimensional cross-covariance matrix 高维交叉协方差矩阵的自动稀疏估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-19 DOI: 10.1016/j.jmva.2025.105590
Tetsuya Umino , Kazuyoshi Yata , Makoto Aoshima
Scenarios involving high-dimensional, low-sample-size (HDLSS) data are often encountered in modern scientific fields involving genetic microarrays, medical imaging, and finance, where the number of variables can greatly exceed the number of observations. In such settings, a reliable estimation of cross-covariance structures is essential for understanding relationships between variable sets. However, classical estimators often exhibit severe noise accumulation. To address this issue, in this study, we propose a novel thresholding estimator of the cross-covariance matrix for HDLSS settings. We consider the asymptotic properties of the sample cross-covariance matrix and show that the estimator contains large amounts of noise in the high-dimensional setting, which renders it inconsistent. To solve this problem occurring in high-dimensional settings, we develop a new thresholding estimator based on the automatic sparse estimation methodology and show that the estimator is consistent under mild assumptions. We analyze and evaluate the performance of the proposed estimator based on numerical simulations and actual data analysis. The simulations demonstrate that the method attains consistency without requiring the stringent high-dimensional conditions assumed by existing approaches, and the real-data analysis illustrates its applicability to high-dimensional regression problems, wherein improved parameter estimation enhances prediction accuracy. In conclusion, our findings serve as a theoretically sound tool for cross-covariance estimation in HDLSS contexts, with potential implications for a wide range of high-dimensional data analyses.
涉及高维、低样本大小(HDLSS)数据的场景经常在涉及基因微阵列、医学成像和金融的现代科学领域中遇到,其中变量的数量可能大大超过观察的数量。在这种情况下,对交叉协方差结构的可靠估计对于理解变量集之间的关系至关重要。然而,经典估计器往往表现出严重的噪声积累。为了解决这个问题,在本研究中,我们提出了一种新的HDLSS设置的交叉协方差矩阵阈值估计器。我们考虑样本交叉协方差矩阵的渐近性质,并表明估计量在高维设置中包含大量的噪声,这使得它不一致。为了解决高维环境中出现的这一问题,我们开发了一种新的基于自动稀疏估计方法的阈值估计器,并证明了该估计器在温和假设下是一致的。基于数值模拟和实际数据分析,对所提估计器的性能进行了分析和评价。仿真结果表明,该方法不需要现有方法所假定的严格的高维条件,即可达到一致性;实际数据分析表明,该方法适用于高维回归问题,其中改进的参数估计提高了预测精度。总之,我们的研究结果可以作为HDLSS背景下交叉协方差估计的理论可靠工具,对广泛的高维数据分析具有潜在的影响。
{"title":"Automatic sparse estimation of the high-dimensional cross-covariance matrix","authors":"Tetsuya Umino ,&nbsp;Kazuyoshi Yata ,&nbsp;Makoto Aoshima","doi":"10.1016/j.jmva.2025.105590","DOIUrl":"10.1016/j.jmva.2025.105590","url":null,"abstract":"<div><div>Scenarios involving high-dimensional, low-sample-size (HDLSS) data are often encountered in modern scientific fields involving genetic microarrays, medical imaging, and finance, where the number of variables can greatly exceed the number of observations. In such settings, a reliable estimation of cross-covariance structures is essential for understanding relationships between variable sets. However, classical estimators often exhibit severe noise accumulation. To address this issue, in this study, we propose a novel thresholding estimator of the cross-covariance matrix for HDLSS settings. We consider the asymptotic properties of the sample cross-covariance matrix and show that the estimator contains large amounts of noise in the high-dimensional setting, which renders it inconsistent. To solve this problem occurring in high-dimensional settings, we develop a new thresholding estimator based on the automatic sparse estimation methodology and show that the estimator is consistent under mild assumptions. We analyze and evaluate the performance of the proposed estimator based on numerical simulations and actual data analysis. The simulations demonstrate that the method attains consistency without requiring the stringent high-dimensional conditions assumed by existing approaches, and the real-data analysis illustrates its applicability to high-dimensional regression problems, wherein improved parameter estimation enhances prediction accuracy. In conclusion, our findings serve as a theoretically sound tool for cross-covariance estimation in HDLSS contexts, with potential implications for a wide range of high-dimensional data analyses.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105590"},"PeriodicalIF":1.4,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837541","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low dimensional factor model-based tests for assessing vector correlation in high-dimensional settings 用于评估高维设置中向量相关性的基于低维因素模型的测试
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-17 DOI: 10.1016/j.jmva.2025.105588
Masashi Hyodo , Takahiro Nishiyama , Shoichi Narita
This study proposes a new test for vector correlation in a high-dimensional framework, while accommodating a low-dimensional latent factor model. Our test, built under low-dimensional factor models, distinguishes from previous normal approximation-based tests, which are valid under a weak-spike structure. We propose a modified RV coefficient for high-dimensional data, and show that its null-limiting distributions follow a weighted mixture of chi-square distributions under a high-dimensional asymptotic regime integrated with weak technical conditions. By applying this asymptotic result and estimation theory of the number of factors in a low-dimensional factor model, we propose a new approximation test for vector correlations. We also derive the asymptotic power function for the proposed test. Lastly, we examine the finite sample and dimensional performance of this test using Monte Carlo simulations.
本研究提出了一个新的测试在高维框架下的向量相关性,同时适应低维潜在因素模型。我们的测试是在低维因子模型下构建的,与之前基于正常近似的测试不同,这些测试在弱尖峰结构下有效。我们提出了一个改进的高维数据的RV系数,并证明了它的零极限分布遵循一个加权的混合卡方分布,在高维渐近状态下与弱技术条件相结合。利用这一渐近结果和低维因子模型中因子数的估计理论,我们提出了一种新的向量相关性的近似检验方法。我们也推导了渐近幂函数。最后,我们使用蒙特卡罗模拟来检验该测试的有限样本和维度性能。
{"title":"Low dimensional factor model-based tests for assessing vector correlation in high-dimensional settings","authors":"Masashi Hyodo ,&nbsp;Takahiro Nishiyama ,&nbsp;Shoichi Narita","doi":"10.1016/j.jmva.2025.105588","DOIUrl":"10.1016/j.jmva.2025.105588","url":null,"abstract":"<div><div>This study proposes a new test for vector correlation in a high-dimensional framework, while accommodating a low-dimensional latent factor model. Our test, built under low-dimensional factor models, distinguishes from previous normal approximation-based tests, which are valid under a weak-spike structure. We propose a modified RV coefficient for high-dimensional data, and show that its null-limiting distributions follow a weighted mixture of chi-square distributions under a high-dimensional asymptotic regime integrated with weak technical conditions. By applying this asymptotic result and estimation theory of the number of factors in a low-dimensional factor model, we propose a new approximation test for vector correlations. We also derive the asymptotic power function for the proposed test. Lastly, we examine the finite sample and dimensional performance of this test using Monte Carlo simulations.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105588"},"PeriodicalIF":1.4,"publicationDate":"2025-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145837540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiplier and empirical subsample bootstraps for maxima in high dimensional time series analysis 高维时间序列分析中最大值的乘子和经验子样本自举
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-16 DOI: 10.1016/j.jmva.2025.105579
Ruru Ma, Shibin Zhang
The distribution of maxima is crucial for simultaneous inference of high dimensional parameters. This paper focuses on the bootstrap approximation to the distribution of maxima in high dimensional time series analysis. We propose two novel approaches, the multiplier subsample bootstrap (MSB) and the empirical subsample bootstrap (ESB), to approximate the distribution of maxima constructed from high dimensional time series. Both approaches utilize block-based subsample statistics to build the bootstrap statistics. The MSB assigns weights to block-based subsample statistics using random elements, while the ESB resamples from them independently and uniformly. Under certain regularity conditions, we establish the asymptotic validity of the two proposed approaches, when the parameter dimension is large or even much larger than the sample size. A simulation study demonstrates that both the MSB and ESB perform well.
极大值的分布对于高维参数的同时推断是至关重要的。研究了高维时间序列分析中极大值分布的自举逼近方法。我们提出了两种新的方法,乘子样本bootstrap (MSB)和经验子样本bootstrap (ESB),以近似高维时间序列构造的最大值的分布。这两种方法都利用基于块的子样本统计来构建自举统计。MSB使用随机元素为基于块的子样本统计信息分配权重,而ESB则独立且统一地从中重新采样。在一定的正则性条件下,当参数维数大于甚至远远大于样本容量时,我们建立了这两种方法的渐近有效性。仿真研究表明,MSB和ESB都具有良好的性能。
{"title":"Multiplier and empirical subsample bootstraps for maxima in high dimensional time series analysis","authors":"Ruru Ma,&nbsp;Shibin Zhang","doi":"10.1016/j.jmva.2025.105579","DOIUrl":"10.1016/j.jmva.2025.105579","url":null,"abstract":"<div><div>The distribution of maxima is crucial for simultaneous inference of high dimensional parameters. This paper focuses on the bootstrap approximation to the distribution of maxima in high dimensional time series analysis. We propose two novel approaches, the multiplier subsample bootstrap (MSB) and the empirical subsample bootstrap (ESB), to approximate the distribution of maxima constructed from high dimensional time series. Both approaches utilize block-based subsample statistics to build the bootstrap statistics. The MSB assigns weights to block-based subsample statistics using random elements, while the ESB resamples from them independently and uniformly. Under certain regularity conditions, we establish the asymptotic validity of the two proposed approaches, when the parameter dimension is large or even much larger than the sample size. A simulation study demonstrates that both the MSB and ESB perform well.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105579"},"PeriodicalIF":1.4,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145788052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method for sparse and robust independent component analysis 稀疏鲁棒独立分量分析方法
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-10 DOI: 10.1016/j.jmva.2025.105587
Lauri Heinonen, Joni Virta
This work presents sparse invariant coordinate selection, SICS, a new method for sparse and robust independent component analysis. SICS is based on classical invariant coordinate selection, which is presented in such a form that a LASSO-type penalty can be applied to promote sparsity. Robustness is achieved by using robust scatter matrices. In the first part of the paper, the background and building blocks: scatter matrices, measures of robustness, ICS and independent component analysis, are carefully introduced. Then the proposed new method and its algorithm are derived and presented. This part also includes consistency and breakdown point results for a general case of sparse ICS-like methods. The performance of SICS in identifying sparse independent component loadings is investigated with multiple simulations. The method is illustrated with an example in constructing sparse causal graphs and we also propose a graphical tool for selecting the appropriate sparsity level in SICS.
本文提出了稀疏不变坐标选择方法(SICS),一种稀疏鲁棒独立分量分析的新方法。SICS基于经典的不变坐标选择,它以lasso类型的惩罚可以应用于提高稀疏性的形式呈现。鲁棒性是通过使用鲁棒散点矩阵实现的。在论文的第一部分,详细介绍了背景和构建模块:散点矩阵、鲁棒性度量、ICS和独立成分分析。然后推导并给出了新方法及其算法。这一部分还包括稀疏类ics方法的一般情况下的一致性和故障点结果。通过多个仿真研究了稀疏独立分量载荷识别的性能。该方法以构建稀疏因果图为例进行了说明,并提出了一种图形化工具来选择合适的稀疏度级别。
{"title":"A method for sparse and robust independent component analysis","authors":"Lauri Heinonen,&nbsp;Joni Virta","doi":"10.1016/j.jmva.2025.105587","DOIUrl":"10.1016/j.jmva.2025.105587","url":null,"abstract":"<div><div>This work presents sparse invariant coordinate selection, SICS, a new method for sparse and robust independent component analysis. SICS is based on classical invariant coordinate selection, which is presented in such a form that a LASSO-type penalty can be applied to promote sparsity. Robustness is achieved by using robust scatter matrices. In the first part of the paper, the background and building blocks: scatter matrices, measures of robustness, ICS and independent component analysis, are carefully introduced. Then the proposed new method and its algorithm are derived and presented. This part also includes consistency and breakdown point results for a general case of sparse ICS-like methods. The performance of SICS in identifying sparse independent component loadings is investigated with multiple simulations. The method is illustrated with an example in constructing sparse causal graphs and we also propose a graphical tool for selecting the appropriate sparsity level in SICS.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105587"},"PeriodicalIF":1.4,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145788051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simultaneous heterogeneity and reduced-rank learning for multivariate response regression 多元反应回归的同时异质性与降阶学习
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-05 DOI: 10.1016/j.jmva.2025.105578
Jie Wu , Bo Zhang , Daoji Li , Zemin Zheng
Heterogeneous data are now ubiquitous in many applications in which correctly identifying the subgroups from a heterogeneous population is critical. Although there is an increasing body of literature on subgroup detection, existing methods mainly focus on the univariate response setting. In this paper, we propose a joint heterogeneity and reduced-rank learning framework to simultaneously identify the subgroup structure and estimate the covariate effects for heterogeneous multivariate response regression. In particular, our approach uses rank-constrained pairwise fusion penalization and conducts the subgroup analysis without requiring prior knowledge regarding the individual subgroup memberships. We implement the proposed approach by an alternating direction method of multipliers (ADMM) algorithm and show its convergence. We also establish the asymptotic properties for the resulting estimators under mild and interpretable conditions. A predictive information criterion is proposed to select the rank of the coefficient matrix with theoretical support. The effectiveness of the proposed approach is demonstrated through simulation studies and a real data application.
异构数据现在在许多应用程序中无处不在,在这些应用程序中,从异构种群中正确识别子组是至关重要的。虽然亚群检测的文献越来越多,但现有的方法主要集中在单变量响应设置上。在本文中,我们提出了一个联合异质性和降阶学习框架,以同时识别子群结构和估计异质性多元响应回归的协变量效应。特别是,我们的方法使用了等级约束的两两融合惩罚,并在不需要关于单个子组成员的先验知识的情况下进行子组分析。我们用交替方向乘法器(ADMM)算法实现了该方法,并证明了其收敛性。在温和和可解释的条件下,我们还建立了所得估计量的渐近性质。提出了一种选择系数矩阵秩的预测信息准则,并提供了理论支持。仿真研究和实际数据应用验证了该方法的有效性。
{"title":"Simultaneous heterogeneity and reduced-rank learning for multivariate response regression","authors":"Jie Wu ,&nbsp;Bo Zhang ,&nbsp;Daoji Li ,&nbsp;Zemin Zheng","doi":"10.1016/j.jmva.2025.105578","DOIUrl":"10.1016/j.jmva.2025.105578","url":null,"abstract":"<div><div>Heterogeneous data are now ubiquitous in many applications in which correctly identifying the subgroups from a heterogeneous population is critical. Although there is an increasing body of literature on subgroup detection, existing methods mainly focus on the univariate response setting. In this paper, we propose a joint heterogeneity and reduced-rank learning framework to simultaneously identify the subgroup structure and estimate the covariate effects for heterogeneous multivariate response regression. In particular, our approach uses rank-constrained pairwise fusion penalization and conducts the subgroup analysis without requiring prior knowledge regarding the individual subgroup memberships. We implement the proposed approach by an alternating direction method of multipliers (ADMM) algorithm and show its convergence. We also establish the asymptotic properties for the resulting estimators under mild and interpretable conditions. A predictive information criterion is proposed to select the rank of the coefficient matrix with theoretical support. The effectiveness of the proposed approach is demonstrated through simulation studies and a real data application.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105578"},"PeriodicalIF":1.4,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145683762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robust mixed functional classifier with adaptive large margin loss 一种自适应大边际损失鲁棒混合函数分类器
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-12-03 DOI: 10.1016/j.jmva.2025.105563
Hanteng Ma , Peijun Sang , Xingdong Feng , Xin Liu
Functional classification has been increasingly helpful in exploring and predicting a response variable with multiple categories. In fact, both functional and scalar covariates may be useful and should be included in the model simultaneously, and thus developing a robust multi-categorical functional classifier with statistical guarantees is desirable. However, both of these two issues are rarely touched in previous studies. Motivated by these, in this paper we propose a novel large margin linear mixed functional classifier for the response with multiple categories, which includes both functional and scalar covariates as predictors, especially when functional data are sparsely longitudinal. Not only does the proposed method address the functional classification using a combination of both functional and scalar covariates, but also provides a robust multi-categorical mixed functional classifier using a large margin loss adaptive to observed samples. Furthermore, we establish statistical theories of a mixed functional classifier, which have been less considered in existing literature. An efficient algorithm is also proposed for its practical implementation. Numerical investigations have supported the superb performance of the proposed method on both simulated and real datasets.
功能分类在探索和预测具有多类别的反应变量方面越来越有帮助。事实上,函数协变量和标量协变量可能都是有用的,应该同时包含在模型中,因此开发具有统计保证的鲁棒多分类函数分类器是可取的。然而,这两个问题在以往的研究中都很少涉及。受此启发,本文提出了一种新的大余量线性混合泛函分类器,用于具有多类别的响应,其中包括函数和标量协变量作为预测因子,特别是当函数数据是稀疏纵向的时。该方法不仅解决了使用泛函协变量和标量协变量组合的功能分类问题,而且还提供了一个鲁棒的多类别混合功能分类器,该分类器使用自适应观察样本的大裕度损失。此外,我们建立了混合功能分类器的统计理论,这在现有文献中很少被考虑。为实现该算法,提出了一种有效的算法。数值研究支持了该方法在模拟和实际数据集上的优异性能。
{"title":"A robust mixed functional classifier with adaptive large margin loss","authors":"Hanteng Ma ,&nbsp;Peijun Sang ,&nbsp;Xingdong Feng ,&nbsp;Xin Liu","doi":"10.1016/j.jmva.2025.105563","DOIUrl":"10.1016/j.jmva.2025.105563","url":null,"abstract":"<div><div>Functional classification has been increasingly helpful in exploring and predicting a response variable with multiple categories. In fact, both functional and scalar covariates may be useful and should be included in the model simultaneously, and thus developing a robust multi-categorical functional classifier with statistical guarantees is desirable. However, both of these two issues are rarely touched in previous studies. Motivated by these, in this paper we propose a novel large margin linear mixed functional classifier for the response with multiple categories, which includes both functional and scalar covariates as predictors, especially when functional data are sparsely longitudinal. Not only does the proposed method address the functional classification using a combination of both functional and scalar covariates, but also provides a robust multi-categorical mixed functional classifier using a large margin loss adaptive to observed samples. Furthermore, we establish statistical theories of a mixed functional classifier, which have been less considered in existing literature. An efficient algorithm is also proposed for its practical implementation. Numerical investigations have supported the superb performance of the proposed method on both simulated and real datasets.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"213 ","pages":"Article 105563"},"PeriodicalIF":1.4,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145735425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Multivariate Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1