首页 > 最新文献

Journal of Multivariate Analysis最新文献

英文 中文
Estimating singular functions of kernel cross-covariance operators: An investigation of the Nyström method 估计核交叉协方差算子的奇异函数:Nyström方法的研究
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-10-10 DOI: 10.1016/j.jmva.2025.105514
Min Xu , Qi-Hang Zhou , Qin Fang , Zhuo-Xi Shi
We investigate the Nyström method as an efficient means of overcoming the computational bottleneck inherent in estimating the singular functions of kernel cross-covariance operators, which play a central role in tasks such as covariate shift correction and multi-view learning. We present a Nyström-type approximation of the kernel cross-covariance operator, and establish its convergence rate. Furthermore, we derive a novel bound on the weighted sum of squared estimation errors of all associated singular functions, providing tighter control than traditional bounds that treat each error individually. Our theoretical analysis reveals that the Nyström-based singular function estimators attain the same statistical accuracy as their full empirical counterparts, while offering significant computational savings. Numerical experiments further confirm the practical effectiveness of the proposed approach.
我们研究了Nyström方法作为克服核交叉协方差算子奇异函数估计固有的计算瓶颈的有效手段,这在协变量移位校正和多视图学习等任务中起着核心作用。给出了核交叉协方差算子的Nyström-type近似,并确定了其收敛速度。此外,我们推导了所有相关奇异函数的加权平方和估计误差的新界,比单独处理每个误差的传统界提供了更严格的控制。我们的理论分析表明,Nyström-based奇异函数估计器获得与完全经验对应的相同的统计精度,同时提供显着的计算节省。数值实验进一步验证了该方法的实用性。
{"title":"Estimating singular functions of kernel cross-covariance operators: An investigation of the Nyström method","authors":"Min Xu ,&nbsp;Qi-Hang Zhou ,&nbsp;Qin Fang ,&nbsp;Zhuo-Xi Shi","doi":"10.1016/j.jmva.2025.105514","DOIUrl":"10.1016/j.jmva.2025.105514","url":null,"abstract":"<div><div>We investigate the Nyström method as an efficient means of overcoming the computational bottleneck inherent in estimating the singular functions of kernel cross-covariance operators, which play a central role in tasks such as covariate shift correction and multi-view learning. We present a Nyström-type approximation of the kernel cross-covariance operator, and establish its convergence rate. Furthermore, we derive a novel bound on the weighted sum of squared estimation errors of all associated singular functions, providing tighter control than traditional bounds that treat each error individually. Our theoretical analysis reveals that the Nyström-based singular function estimators attain the same statistical accuracy as their full empirical counterparts, while offering significant computational savings. Numerical experiments further confirm the practical effectiveness of the proposed approach.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105514"},"PeriodicalIF":1.4,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145266689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model-based Fréchet regression in (quotient) metric spaces with a focus on elastic curves (商)度量空间中基于模型的frsamchet回归,重点是弹性曲线
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-10-08 DOI: 10.1016/j.jmva.2025.105515
Lisa Steyer , Almond Stöcker , Sonja Greven , Alzheimer’s Disease Neuroimaging Initiative
We introduce model-based Fréchet regression in metric spaces. Instead of starting from point-wise conditional Fréchet means, our approach is defined as a constrained minimization problem over a model class of functions. The approach is then applied to develop a general framework of regression for quotient metric spaces with distances induced by isometric group actions. Such spaces arise naturally in applications where objects are considered equivalent up to transformations. We first establish general existence and consistency results for model-based Fréchet regression, with our quotient space regression model as a special case. As an important example we consider regression for elastic curves in the square-root velocity framework. This addresses data such as handwritten letters, movement paths, or outlines of objects, where only the image but not the parametrization of the curves is of interest. To handle sparsely or irregularly sampled curves, we model smooth conditional mean curves using splines. We validate our approach through simulations and an application to hippocampal outlines extracted from Magnetic Resonance Imaging scans. Here we model how the shape of the irregularly sampled hippocampus is related to age, Alzheimer’s disease and sex, to disentangle the shrinking effects of Alzheimer’s from normal aging.
我们在度量空间中引入了基于模型的fracimchet回归。我们的方法被定义为一个函数模型类上的约束最小化问题,而不是从逐点的条件法开始。然后将该方法应用于开发具有等距群作用诱导距离的商度量空间的一般回归框架。这样的空间自然出现在对象被认为等同于转换的应用程序中。首先,我们以商空间回归模型为特例,建立了基于模型的frachimet回归的一般存在性和一致性结果。作为一个重要的例子,我们考虑了平方根速度框架下弹性曲线的回归。这处理诸如手写字母、移动路径或物体轮廓之类的数据,在这些数据中,只有图像而不是曲线的参数化感兴趣。为了处理稀疏或不规则采样曲线,我们使用样条对光滑条件平均曲线建模。我们通过模拟和应用从磁共振成像扫描中提取的海马轮廓来验证我们的方法。在这里,我们对不规则取样的海马体的形状与年龄、阿尔茨海默病和性别之间的关系进行了建模,以解开阿尔茨海默病与正常衰老之间的萎缩效应。
{"title":"Model-based Fréchet regression in (quotient) metric spaces with a focus on elastic curves","authors":"Lisa Steyer ,&nbsp;Almond Stöcker ,&nbsp;Sonja Greven ,&nbsp;Alzheimer’s Disease Neuroimaging Initiative","doi":"10.1016/j.jmva.2025.105515","DOIUrl":"10.1016/j.jmva.2025.105515","url":null,"abstract":"<div><div>We introduce model-based Fréchet regression in metric spaces. Instead of starting from point-wise conditional Fréchet means, our approach is defined as a constrained minimization problem over a model class of functions. The approach is then applied to develop a general framework of regression for quotient metric spaces with distances induced by isometric group actions. Such spaces arise naturally in applications where objects are considered equivalent up to transformations. We first establish general existence and consistency results for model-based Fréchet regression, with our quotient space regression model as a special case. As an important example we consider regression for elastic curves in the square-root velocity framework. This addresses data such as handwritten letters, movement paths, or outlines of objects, where only the image but not the parametrization of the curves is of interest. To handle sparsely or irregularly sampled curves, we model smooth conditional mean curves using splines. We validate our approach through simulations and an application to hippocampal outlines extracted from Magnetic Resonance Imaging scans. Here we model how the shape of the irregularly sampled hippocampus is related to age, Alzheimer’s disease and sex, to disentangle the shrinking effects of Alzheimer’s from normal aging.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105515"},"PeriodicalIF":1.4,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145321069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal estimation for a family of sparse covariance matrices with missing data 缺失数据稀疏协方差矩阵族的最优估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-10-06 DOI: 10.1016/j.jmva.2025.105513
Youming Liu , Li Miao
Estimation of covariance matrices plays an important role in high-dimensional inference problems. It has been investigated when some observations are missing. The known work usually assume the Gaussian or sub-Gaussian condition of a random vector. Cai and Zhang provide an optimal estimation for a class of sparse covariance matrices H under the sub-Gaussian assumption of a random vector, see T. T. Cai and A. Zhang, Journal of Multivariate Analysis, 2016. This current paper considers the same problem for a larger family of sparse covariance matrices Hɛ(0<ɛ2) under some weaker assumptions (not necessarily sub-Gaussian) of a random vector. When ɛ=2, our results generalize a theorem of Cai and Zhang. Numerical experiments are given to support our theoretical analysis.
协方差矩阵的估计在高维推理问题中起着重要的作用。在一些观测缺失的情况下,对它进行了调查。已知功通常假设随机向量的高斯或亚高斯条件。Cai和Zhang在随机向量的亚高斯假设下提供了一类稀疏协方差矩阵H的最优估计,见t.t. Cai和a . Zhang, Journal of Multivariate Analysis, 2016。本文在随机向量的一些较弱的假设(不一定是亚高斯)下,考虑了一个较大的稀疏协方差矩阵族H [0<;]≤2的相同问题。当k =2时,我们的结果推广了Cai和Zhang的一个定理。数值实验结果支持了理论分析。
{"title":"Optimal estimation for a family of sparse covariance matrices with missing data","authors":"Youming Liu ,&nbsp;Li Miao","doi":"10.1016/j.jmva.2025.105513","DOIUrl":"10.1016/j.jmva.2025.105513","url":null,"abstract":"<div><div>Estimation of covariance matrices plays an important role in high-dimensional inference problems. It has been investigated when some observations are missing. The known work usually assume the Gaussian or sub-Gaussian condition of a random vector. Cai and Zhang provide an optimal estimation for a class of sparse covariance matrices <span><math><mi>H</mi></math></span> under the sub-Gaussian assumption of a random vector, see T. T. Cai and A. Zhang, Journal of Multivariate Analysis, 2016. This current paper considers the same problem for a larger family of sparse covariance matrices <span><math><mrow><msub><mrow><mi>H</mi></mrow><mrow><mi>ɛ</mi></mrow></msub><mspace></mspace><mrow><mo>(</mo><mn>0</mn><mo>&lt;</mo><mi>ɛ</mi><mo>≤</mo><mn>2</mn><mo>)</mo></mrow></mrow></math></span> under some weaker assumptions (not necessarily sub-Gaussian) of a random vector. When <span><math><mrow><mi>ɛ</mi><mo>=</mo><mn>2</mn></mrow></math></span>, our results generalize a theorem of Cai and Zhang. Numerical experiments are given to support our theoretical analysis.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105513"},"PeriodicalIF":1.4,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145321070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Properties of CoVaR based on tail expansions of copulas 基于copula尾部展开的CoVaR性质
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-29 DOI: 10.1016/j.jmva.2025.105510
Xiaoting Li, Harry Joe
The theoretical properties of two widely used CoVaR definitions are investigated under different dependence structures in joint distributions. By using copulas, the dependence is separated from marginal distributions, and CoVaR is expressed through an adjustment factor based solely on the copula. The primary contribution is to study the limiting behavior of the adjustment factor and its link to the strength of dependence in the tails of the joint distribution. We also provide asymptotic results for bivariate Archimedean copulas and extend these findings to extreme value copulas and their mixtures. These findings enhance the understanding of CoVaR in risk scenarios, particularly as the conditional event becomes more extreme.
在联合分布的不同依赖结构下,研究了两种广泛使用的CoVaR定义的理论性质。通过使用copula,将相关性与边际分布分离开来,并通过仅基于copula的调整因子来表示CoVaR。主要贡献是研究了调整因子的极限行为及其与联合分布尾部依赖强度的联系。我们还提供了二元阿基米德copuls的渐近结果,并将这些发现推广到极值copuls及其混合物。这些发现加强了对风险情景中CoVaR的理解,特别是当条件事件变得更加极端时。
{"title":"Properties of CoVaR based on tail expansions of copulas","authors":"Xiaoting Li,&nbsp;Harry Joe","doi":"10.1016/j.jmva.2025.105510","DOIUrl":"10.1016/j.jmva.2025.105510","url":null,"abstract":"<div><div>The theoretical properties of two widely used CoVaR definitions are investigated under different dependence structures in joint distributions. By using copulas, the dependence is separated from marginal distributions, and CoVaR is expressed through an adjustment factor based solely on the copula. The primary contribution is to study the limiting behavior of the adjustment factor and its link to the strength of dependence in the tails of the joint distribution. We also provide asymptotic results for bivariate Archimedean copulas and extend these findings to extreme value copulas and their mixtures. These findings enhance the understanding of CoVaR in risk scenarios, particularly as the conditional event becomes more extreme.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105510"},"PeriodicalIF":1.4,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145266692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dimension selection in tensor decompositions and envelope models 张量分解和包络模型中的维数选择
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-25 DOI: 10.1016/j.jmva.2025.105512
Xin Zhang , Wenbiao Zhao , Lixing Zhu
The statistical analysis of tensor-valued data has emerged as an area of increasing methodological focus. Tensor decomposition models and low-rank tensor regression models often assume that there exists a low-rank structure of the tensor data or tensor coefficient. Consistent selection of structural dimensions or tensor ranks constitutes a problem of significant theoretical and practical importance. This paper introduces a unified framework for addressing this challenge, applicable across multiple tensor decomposition frameworks and envelope regression models.
张量值数据的统计分析已成为一个日益关注的方法论领域。张量分解模型和低秩张量回归模型通常假设张量数据或张量系数存在低秩结构。结构维数或张量秩的一致选择是一个具有重要理论和实践意义的问题。本文介绍了一个统一的框架来解决这一挑战,适用于多个张量分解框架和包络回归模型。
{"title":"Dimension selection in tensor decompositions and envelope models","authors":"Xin Zhang ,&nbsp;Wenbiao Zhao ,&nbsp;Lixing Zhu","doi":"10.1016/j.jmva.2025.105512","DOIUrl":"10.1016/j.jmva.2025.105512","url":null,"abstract":"<div><div>The statistical analysis of tensor-valued data has emerged as an area of increasing methodological focus. Tensor decomposition models and low-rank tensor regression models often assume that there exists a low-rank structure of the tensor data or tensor coefficient. Consistent selection of structural dimensions or tensor ranks constitutes a problem of significant theoretical and practical importance. This paper introduces a unified framework for addressing this challenge, applicable across multiple tensor decomposition frameworks and envelope regression models.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105512"},"PeriodicalIF":1.4,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145321071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical structure-guided high-dimensional multi-view clustering 层次结构引导的高维多视图聚类
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-24 DOI: 10.1016/j.jmva.2025.105488
Jiajia Jiang , Kuangnan Fang , Shuangge Ma , Qingzhao Zhang
Multi-view data clustering is pivotal for comprehending the heterogeneous structure of data by integrating information from diverse aspects. Nevertheless, practical challenges arise due to the differences in the granularity from different views, resulting in a hierarchical clustering structure within these distinct data types. In this work, we consider such structure information and propose a novel high-dimensional multi-view clustering approach with a hierarchical structure across views. The proposed non-convex problem is effectively tackled using the Alternating Direction Method of Multipliers algorithm, and we establish the statistical properties of the estimator. Simulation results demonstrate the effectiveness and superiority of our proposed method. In the analysis of the histopathological imaging data and gene expression data related to lung adenocarcinoma, our method unveils a hierarchical clustering structure that significantly diverges from alternative approaches.
多视图数据聚类是通过集成来自不同方面的信息来理解数据异构结构的关键。然而,由于来自不同视图的粒度不同,在这些不同的数据类型中产生了分层聚类结构,从而带来了实际的挑战。在这项工作中,我们考虑了这些结构信息,并提出了一种新颖的高维多视图聚类方法,该方法具有跨视图的分层结构。利用乘法器的交替方向法有效地解决了所提出的非凸问题,并建立了估计量的统计性质。仿真结果验证了该方法的有效性和优越性。在分析与肺腺癌相关的组织病理成像数据和基因表达数据时,我们的方法揭示了与其他方法明显不同的分层聚类结构。
{"title":"Hierarchical structure-guided high-dimensional multi-view clustering","authors":"Jiajia Jiang ,&nbsp;Kuangnan Fang ,&nbsp;Shuangge Ma ,&nbsp;Qingzhao Zhang","doi":"10.1016/j.jmva.2025.105488","DOIUrl":"10.1016/j.jmva.2025.105488","url":null,"abstract":"<div><div>Multi-view data clustering is pivotal for comprehending the heterogeneous structure of data by integrating information from diverse aspects. Nevertheless, practical challenges arise due to the differences in the granularity from different views, resulting in a hierarchical clustering structure within these distinct data types. In this work, we consider such structure information and propose a novel high-dimensional multi-view clustering approach with a hierarchical structure across views. The proposed non-convex problem is effectively tackled using the Alternating Direction Method of Multipliers algorithm, and we establish the statistical properties of the estimator. Simulation results demonstrate the effectiveness and superiority of our proposed method. In the analysis of the histopathological imaging data and gene expression data related to lung adenocarcinoma, our method unveils a hierarchical clustering structure that significantly diverges from alternative approaches.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105488"},"PeriodicalIF":1.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145155083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying differential networks through high-dimensional two-sample inference 通过高维双样本推理识别差分网络
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-24 DOI: 10.1016/j.jmva.2025.105511
Hui Chen , Yinxu Jia
In this article, we identify differential networks within the Gaussian graphical model framework by examining the equivalence of two precision matrices. It is challenging work when the dimension of the precision matrix increases with the sample size. Existing methods typically assume sparsity in the precision matrix structure, a condition often unmet in real data. To address this issue, we introduce a statistic based on debiased estimator of the high-dimensional precision matrix and employ multiplier bootstrap to approximate the null distribution of the proposed statistic. The proposed method can be easily coupled with various estimation algorithms for high-dimensional precision matrix. In comparison with existing methods, the superiority of the proposed approach lies in mild structure constraints to the unknown precision matrix, making it robust to intricate conditional dependence structures in real data. Additionally, we introduce a cross-fitting procedure that utilizes full data information, leading to enhanced statistical power. Theoretical justification is provided to ensure the validity of the proposed method without restrictive assumptions. We showcase the effectiveness of our proposed method by simulation and real data example, which provides evidence of the proposed method’s usefulness and potential for application in various domains.
在本文中,我们通过检查两个精度矩阵的等价性来识别高斯图形模型框架内的微分网络。当精度矩阵的尺寸随样本量的增加而增加时,这是一项具有挑战性的工作。现有方法通常在精确矩阵结构中假定稀疏性,而这一条件在实际数据中往往不满足。为了解决这个问题,我们引入了一种基于高维精度矩阵的去偏估计量的统计量,并使用乘法器自举来近似所提出的统计量的零分布。该方法可以方便地与各种高维精度矩阵的估计算法相结合。与现有方法相比,该方法的优点在于对未知精度矩阵的结构约束较轻,对实际数据中复杂的条件依赖结构具有较强的鲁棒性。此外,我们引入了一个交叉拟合程序,利用完整的数据信息,从而提高了统计能力。在没有限制性假设的情况下,为保证所提方法的有效性提供了理论依据。通过仿真和实际数据实例验证了所提方法的有效性,证明了所提方法在各个领域的实用性和应用潜力。
{"title":"Identifying differential networks through high-dimensional two-sample inference","authors":"Hui Chen ,&nbsp;Yinxu Jia","doi":"10.1016/j.jmva.2025.105511","DOIUrl":"10.1016/j.jmva.2025.105511","url":null,"abstract":"<div><div>In this article, we identify differential networks within the Gaussian graphical model framework by examining the equivalence of two precision matrices. It is challenging work when the dimension of the precision matrix increases with the sample size. Existing methods typically assume sparsity in the precision matrix structure, a condition often unmet in real data. To address this issue, we introduce a statistic based on debiased estimator of the high-dimensional precision matrix and employ multiplier bootstrap to approximate the null distribution of the proposed statistic. The proposed method can be easily coupled with various estimation algorithms for high-dimensional precision matrix. In comparison with existing methods, the superiority of the proposed approach lies in mild structure constraints to the unknown precision matrix, making it robust to intricate conditional dependence structures in real data. Additionally, we introduce a cross-fitting procedure that utilizes full data information, leading to enhanced statistical power. Theoretical justification is provided to ensure the validity of the proposed method without restrictive assumptions. We showcase the effectiveness of our proposed method by simulation and real data example, which provides evidence of the proposed method’s usefulness and potential for application in various domains.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105511"},"PeriodicalIF":1.4,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145266690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint graphical lasso with regularized aggregation 正则化聚合的联合图形套索
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-23 DOI: 10.1016/j.jmva.2025.105509
Jongik Chung , Qihu Zhang , Jennifer E. Mcdowell , Cheolwoo Park
We present methods for estimating multiple precision matrices for high-dimensional time series within the framework of Gaussian graphical models, with a specific focus on analyzing functional magnetic resonance imaging (fMRI) data collected from multiple subjects. Our goal is to estimate both individual brain networks and a collective structure representing a group of subjects. To achieve this, we propose a method that utilizes group Graphical Lasso and regularized aggregation to simultaneously estimate individual and group precision matrices, assigning varying weights to each individual based on their outlier status within the group. We investigate the convergence rates of precision matrix estimators under various norms and expectations, assessing their performance with sub-Gaussian and heavy-tailed data. The effectiveness of our methods is demonstrated through simulations and real fMRI data analysis.
我们提出了在高斯图形模型框架内估计高维时间序列的多个精度矩阵的方法,特别侧重于分析从多个受试者收集的功能磁共振成像(fMRI)数据。我们的目标是估计个体大脑网络和代表一组受试者的集体结构。为了实现这一点,我们提出了一种方法,利用群体图形Lasso和正则化聚合来同时估计个体和群体的精度矩阵,根据他们在群体中的异常状态为每个个体分配不同的权重。我们研究了精度矩阵估计器在不同范数和期望下的收敛速度,评估了它们在亚高斯和重尾数据下的性能。通过仿真和实际fMRI数据分析证明了我们方法的有效性。
{"title":"Joint graphical lasso with regularized aggregation","authors":"Jongik Chung ,&nbsp;Qihu Zhang ,&nbsp;Jennifer E. Mcdowell ,&nbsp;Cheolwoo Park","doi":"10.1016/j.jmva.2025.105509","DOIUrl":"10.1016/j.jmva.2025.105509","url":null,"abstract":"<div><div>We present methods for estimating multiple precision matrices for high-dimensional time series within the framework of Gaussian graphical models, with a specific focus on analyzing functional magnetic resonance imaging (fMRI) data collected from multiple subjects. Our goal is to estimate both individual brain networks and a collective structure representing a group of subjects. To achieve this, we propose a method that utilizes group Graphical Lasso and regularized aggregation to simultaneously estimate individual and group precision matrices, assigning varying weights to each individual based on their outlier status within the group. We investigate the convergence rates of precision matrix estimators under various norms and expectations, assessing their performance with sub-Gaussian and heavy-tailed data. The effectiveness of our methods is demonstrated through simulations and real fMRI data analysis.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105509"},"PeriodicalIF":1.4,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145266691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel martingale difference correlation via data splitting with applications in feature screening 一种新的基于数据分割的鞅差相关性及其在特征筛选中的应用
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-19 DOI: 10.1016/j.jmva.2025.105508
Zhengyu Zhu , Jicai Liu , Riquan Zhang
In this paper, we introduce a novel sample martingale difference correlation via data splitting to measure the departure of conditional mean independence between a response variable Y and a vector predictor X. The proposed correlation converges to zero and has an asymptotically symmetric sampling distribution around zero when Y and X are conditionally mean independent. In contrast, it converges to a positive value when Y and X are conditionally mean dependent. Leveraging these properties, we develop a new model-free feature screening method with false discovery rate (FDR) control for ultrahigh-dimensional data. We demonstrate that this screening method achieves FDR control and the sure screening property simultaneously. We also extend our approach to conditional quantile screening with FDR control. To further enhance the stability of the screening results, we implement multiple splitting techniques. We evaluate the finite sample performance of our proposed methods through simulations and real data analyses, and compare them with existing methods.
在本文中,我们通过数据分割引入了一种新的样本鞅差相关来度量响应变量Y和向量预测变量X之间的条件均值独立偏离,当Y和X是条件均值独立时,所提出的相关收敛于零,并且在零附近具有渐近对称的抽样分布。相反,当Y和X是条件平均相关时,它收敛到一个正值。利用这些特性,我们开发了一种新的无模型特征筛选方法,该方法具有超高维数据的错误发现率(FDR)控制。结果表明,该筛分方法既实现了FDR控制,又保证了筛分性能。我们还将我们的方法扩展到FDR控制的条件分位数筛选。为了进一步提高筛选结果的稳定性,我们采用了多重拆分技术。我们通过模拟和实际数据分析来评估我们提出的方法的有限样本性能,并将它们与现有方法进行比较。
{"title":"A novel martingale difference correlation via data splitting with applications in feature screening","authors":"Zhengyu Zhu ,&nbsp;Jicai Liu ,&nbsp;Riquan Zhang","doi":"10.1016/j.jmva.2025.105508","DOIUrl":"10.1016/j.jmva.2025.105508","url":null,"abstract":"<div><div>In this paper, we introduce a novel sample martingale difference correlation via data splitting to measure the departure of conditional mean independence between a response variable <span><math><mi>Y</mi></math></span> and a vector predictor <span><math><mi>X</mi></math></span>. The proposed correlation converges to zero and has an asymptotically symmetric sampling distribution around zero when <span><math><mi>Y</mi></math></span> and <span><math><mi>X</mi></math></span> are conditionally mean independent. In contrast, it converges to a positive value when <span><math><mi>Y</mi></math></span> and <span><math><mi>X</mi></math></span> are conditionally mean dependent. Leveraging these properties, we develop a new model-free feature screening method with false discovery rate (FDR) control for ultrahigh-dimensional data. We demonstrate that this screening method achieves FDR control and the sure screening property simultaneously. We also extend our approach to conditional quantile screening with FDR control. To further enhance the stability of the screening results, we implement multiple splitting techniques. We evaluate the finite sample performance of our proposed methods through simulations and real data analyses, and compare them with existing methods.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105508"},"PeriodicalIF":1.4,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145096561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On nonparametric functional data regression with incomplete observations 不完全观测值下的非参数函数数据回归
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-09-17 DOI: 10.1016/j.jmva.2025.105497
Majid Mojirsheibani
In this work we consider the problem of nonparametric estimation of a regression function m(χ)=E(Y|χ=χ) with the functional covariate χ when the response Y may be missing according to a missing-not-at-random (MNAR) setup, i.e., when the underlying missing probability mechanism can depend on both χ and Y. Our proposed estimator is based on a particular representation of the regression function m(χ) in terms of four associated conditional expectations that can be estimated nonparametrically. To assess the theoretical performance of our estimators, we study their convergence properties in general Lp norms where we also look into their rates of convergence. Our numerical results show that the proposed estimators have good finite-sample performance. We also explore the applications of our results to the problem of statistical classification with missing labels and establish a number of convergence results for new kernel-type classification rules.
在这项工作中,我们考虑了回归函数m(χ)=E(Y|χ=χ)与函数协变量χ的非参数估计问题,当响应Y根据缺失非随机(MNAR)设置可能缺失时,即,当潜在的缺失概率机制可以依赖于χ和Y时。我们提出的估计器基于回归函数m(χ)的特定表示,其中包含四个可以非参数估计的相关条件期望。为了评估我们的估计器的理论性能,我们研究了它们在一般Lp范数下的收敛性质,并研究了它们的收敛速率。数值结果表明,所提估计器具有良好的有限样本性能。我们还探索了我们的结果在缺少标签的统计分类问题上的应用,并建立了一些新的核类型分类规则的收敛结果。
{"title":"On nonparametric functional data regression with incomplete observations","authors":"Majid Mojirsheibani","doi":"10.1016/j.jmva.2025.105497","DOIUrl":"10.1016/j.jmva.2025.105497","url":null,"abstract":"<div><div>In this work we consider the problem of nonparametric estimation of a regression function <span><math><mrow><mi>m</mi><mrow><mo>(</mo><mi>χ</mi><mo>)</mo></mrow><mo>=</mo><mi>E</mi><mrow><mo>(</mo><mi>Y</mi><mo>|</mo><mspace></mspace><mi>χ</mi><mo>=</mo><mi>χ</mi><mo>)</mo></mrow></mrow></math></span> with the functional covariate <span><math><mrow><mi>χ</mi></mrow></math></span> when the response <span><math><mi>Y</mi></math></span> may be missing according to a missing-not-at-random (MNAR) setup, i.e., when the underlying missing probability mechanism can depend on both <span><math><mrow><mi>χ</mi></mrow></math></span> and <span><math><mi>Y</mi></math></span>. Our proposed estimator is based on a particular representation of the regression function <span><math><mrow><mi>m</mi><mrow><mo>(</mo><mi>χ</mi><mo>)</mo></mrow></mrow></math></span> in terms of four associated conditional expectations that can be estimated nonparametrically. To assess the theoretical performance of our estimators, we study their convergence properties in general <span><math><msup><mrow><mi>L</mi></mrow><mrow><mi>p</mi></mrow></msup></math></span> norms where we also look into their rates of convergence. Our numerical results show that the proposed estimators have good finite-sample performance. We also explore the applications of our results to the problem of statistical classification with missing labels and establish a number of convergence results for new kernel-type classification rules.</div></div>","PeriodicalId":16431,"journal":{"name":"Journal of Multivariate Analysis","volume":"211 ","pages":"Article 105497"},"PeriodicalIF":1.4,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145096559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Multivariate Analysis
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1