首页 > 最新文献

Test最新文献

英文 中文
Conformal link prediction for false discovery rate control 控制误发现率的共形链路预测
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-06-11 DOI: 10.1007/s11749-024-00934-w
Ariane Marandon

Most link prediction methods return estimates of the connection probability of missing edges in a graph. Such output can be used to rank the missing edges from most to least likely to be a true edge, but does not directly provide a classification into true and nonexistent. In this work, we consider the problem of identifying a set of true edges with a control of the false discovery rate (FDR). We propose a novel method based on high-level ideas from the literature on conformal inference. The graph structure induces intricate dependence in the data, which we carefully take into account, as this makes the setup different from the usual setup in conformal inference, where data exchangeability is assumed. The FDR control is empirically demonstrated for both simulated and real data.

大多数链接预测方法都会返回图中缺失边的连接概率估计值。这种输出结果可用于将缺失边缘从最有可能成为真边缘到最不可能成为真边缘进行排序,但不能直接提供真边缘和不存在边缘的分类。在这项工作中,我们考虑的问题是在控制错误发现率 (FDR) 的情况下识别一组真边缘。我们提出了一种基于共形推理文献中高层次思想的新方法。图结构引起了数据中错综复杂的依赖性,我们仔细考虑了这一点,因为这使得设置不同于保角推理中的通常设置,在保角推理中,数据交换性是假定的。我们对模拟数据和真实数据的 FDR 控制进行了实证验证。
{"title":"Conformal link prediction for false discovery rate control","authors":"Ariane Marandon","doi":"10.1007/s11749-024-00934-w","DOIUrl":"https://doi.org/10.1007/s11749-024-00934-w","url":null,"abstract":"<p>Most link prediction methods return estimates of the connection probability of missing edges in a graph. Such output can be used to rank the missing edges from most to least likely to be a true edge, but does not directly provide a classification into true and nonexistent. In this work, we consider the problem of identifying a set of true edges with a control of the false discovery rate (FDR). We propose a novel method based on high-level ideas from the literature on conformal inference. The graph structure induces intricate dependence in the data, which we carefully take into account, as this makes the setup different from the usual setup in conformal inference, where data exchangeability is assumed. The FDR control is empirically demonstrated for both simulated and real data.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"132 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141509526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Partly linear instrumental variables regressions without smoothing on the instruments 不对工具进行平滑处理的部分线性工具变量回归
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-30 DOI: 10.1007/s11749-024-00931-z
Jean-Pierre Florens, Elia Lapenta

We consider a semiparametric partly linear model identified by instrumental variables. We propose an estimation method that does not smooth on the instruments and we extend the Landweber–Fridman regularization scheme to the estimation of this semiparametric model. We then show the asymptotic normality of the parametric estimator and obtain the convergence rate for the nonparametric estimator. Our estimator that does not smooth on the instruments coincides with a typical estimator that does smooth on the instruments but keeps the respective bandwidth fixed as the sample size increases. We propose a data driven method for the selection of the regularization parameter, and in a simulation study we show the attractive performance of our estimators.

我们考虑了一个由工具变量确定的半参数部分线性模型。我们提出了一种不依赖工具的估计方法,并将 Landweber-Fridman 正则化方案扩展到该半参数模型的估计中。然后,我们展示了参数估计器的渐近正态性,并获得了非参数估计器的收敛率。我们不对工具进行平滑的估计器与对工具进行平滑的典型估计器不谋而合,后者会随着样本量的增加而保持各自的带宽固定不变。我们提出了一种数据驱动的正则化参数选择方法,并在模拟研究中展示了我们的估计器极具吸引力的性能。
{"title":"Partly linear instrumental variables regressions without smoothing on the instruments","authors":"Jean-Pierre Florens, Elia Lapenta","doi":"10.1007/s11749-024-00931-z","DOIUrl":"https://doi.org/10.1007/s11749-024-00931-z","url":null,"abstract":"<p>We consider a semiparametric partly linear model identified by instrumental variables. We propose an estimation method that does not smooth on the instruments and we extend the Landweber–Fridman regularization scheme to the estimation of this semiparametric model. We then show the asymptotic normality of the parametric estimator and obtain the convergence rate for the nonparametric estimator. Our estimator that does not smooth on the instruments coincides with a typical estimator that does smooth on the instruments but keeps the respective bandwidth fixed as the sample size increases. We propose a data driven method for the selection of the regularization parameter, and in a simulation study we show the attractive performance of our estimators.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"64 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141197455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A new sufficient dimension reduction method via rank divergence 通过秩发散实现充分降维的新方法
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-30 DOI: 10.1007/s11749-024-00929-7
Tianqing Liu, Danning Li, Fengjiao Ren, Jianguo Sun, Xiaohui Yuan

Sufficient dimension reduction is commonly performed to achieve data reduction and help data visualization. Its main goal is to identify functions of the predictors that are smaller in number than the predictors and contain the same information as the predictors for the response. In this paper, we are concerned with the linear functions of the predictors, which determine a central subspace that preserves sufficient information about the conditional distribution of a response given covariates. Many methods have been developed in the literature for the estimation of the central subspace. However, most of the existing sufficient dimension reduction methods are sensitive to outliers and require some strict restrictions on both covariates and response. To address this, we propose a novel dependence measure, rank divergence, and develop a rank divergence-based sufficient dimension reduction approach. The new method only requires some mild conditions on the covariates and response and is robust to outliers or heavy-tailed distributions. Moreover, it applies to both discrete or categorical covariates and multivariate responses. The consistency of the resulting estimator of the central subspace is established, and numerical studies suggest that it works well in practical situations.

充分降维通常用于实现数据缩减和帮助数据可视化。其主要目标是确定预测因子的函数,这些函数的数量少于预测因子,且包含与响应预测因子相同的信息。在本文中,我们关注的是预测因子的线性函数,它确定了一个中心子空间,该空间保留了给定协变量时响应条件分布的足够信息。文献中提出了许多估计中心子空间的方法。然而,大多数现有的充分降维方法对异常值都很敏感,并且对协变量和响应都有严格的限制。为此,我们提出了一种新的依赖性度量--秩发散,并开发了一种基于秩发散的充分降维方法。这种新方法只需要对协变量和响应设定一些温和的条件,并且对异常值或重尾分布具有鲁棒性。此外,它还适用于离散或分类协变量和多变量响应。由此得到的中心子空间估计值的一致性已得到确定,数值研究表明它在实际情况下运行良好。
{"title":"A new sufficient dimension reduction method via rank divergence","authors":"Tianqing Liu, Danning Li, Fengjiao Ren, Jianguo Sun, Xiaohui Yuan","doi":"10.1007/s11749-024-00929-7","DOIUrl":"https://doi.org/10.1007/s11749-024-00929-7","url":null,"abstract":"<p>Sufficient dimension reduction is commonly performed to achieve data reduction and help data visualization. Its main goal is to identify functions of the predictors that are smaller in number than the predictors and contain the same information as the predictors for the response. In this paper, we are concerned with the linear functions of the predictors, which determine a central subspace that preserves sufficient information about the conditional distribution of a response given covariates. Many methods have been developed in the literature for the estimation of the central subspace. However, most of the existing sufficient dimension reduction methods are sensitive to outliers and require some strict restrictions on both covariates and response. To address this, we propose a novel dependence measure, rank divergence, and develop a rank divergence-based sufficient dimension reduction approach. The new method only requires some mild conditions on the covariates and response and is robust to outliers or heavy-tailed distributions. Moreover, it applies to both discrete or categorical covariates and multivariate responses. The consistency of the resulting estimator of the central subspace is established, and numerical studies suggest that it works well in practical situations.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"72 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141197326","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Asymptotic results for nonparametric regression estimators after sufficient dimension reduction estimation 充分降维估计后非参数回归估计器的渐近结果
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-05-28 DOI: 10.1007/s11749-024-00932-y
Liliana Forzani, Daniela Rodriguez, Mariela Sued

Prediction, in regression and classification, is one of the main aims in modern data science. When the number of predictors is large, a common first step is to reduce the dimension of the data. Sufficient dimension reduction (SDR) is a well-established paradigm of reduction that keeps all the relevant information in the covariates X that is necessary for the prediction of Y. In practice, SDR has been successfully used as an exploratory tool for modeling after estimation of the sufficient reduction. Nevertheless, even if the estimated reduction is a consistent estimator of the population, there is no theory supporting this step when nonparametric regression is used in the imputed estimator. In this paper, we show that the asymptotic distribution of the nonparametric regression estimator remains unchanged whether the true SDR or its estimator is used. This result allows making inferences, for example, computing confidence intervals for the regression function, thereby avoiding the curse of dimensionality.

回归和分类中的预测是现代数据科学的主要目标之一。当预测因子数量较多时,第一步通常是降低数据维度。充分降维(SDR)是一种行之有效的降维范式,它能保留协变量 X 中对预测 Y 必不可少的所有相关信息。然而,即使估计出的还原是一个一致的总体估计器,但当在估算估计器中使用非参数回归时,并没有理论支持这一步骤。在本文中,我们证明了无论使用真实 SDR 还是其估计值,非参数回归估计值的渐近分布都保持不变。这一结果允许进行推论,例如计算回归函数的置信区间,从而避免了维度诅咒。
{"title":"Asymptotic results for nonparametric regression estimators after sufficient dimension reduction estimation","authors":"Liliana Forzani, Daniela Rodriguez, Mariela Sued","doi":"10.1007/s11749-024-00932-y","DOIUrl":"https://doi.org/10.1007/s11749-024-00932-y","url":null,"abstract":"<p>Prediction, in regression and classification, is one of the main aims in modern data science. When the number of predictors is large, a common first step is to reduce the dimension of the data. Sufficient dimension reduction (SDR) is a well-established paradigm of reduction that keeps all the relevant information in the covariates <i>X</i> that is necessary for the prediction of <i>Y</i>. In practice, SDR has been successfully used as an exploratory tool for modeling after estimation of the sufficient reduction. Nevertheless, even if the estimated reduction is a consistent estimator of the population, there is no theory supporting this step when nonparametric regression is used in the imputed estimator. In this paper, we show that the asymptotic distribution of the nonparametric regression estimator remains unchanged whether the true SDR or its estimator is used. This result allows making inferences, for example, computing confidence intervals for the regression function, thereby avoiding the curse of dimensionality.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"70 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141173537","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Privacy-preserving parametric inference for spatial autoregressive model 空间自回归模型的隐私保护参数推理
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-09 DOI: 10.1007/s11749-024-00928-8
Zhijian Wang, Yunquan Song

Spatial regression models are important tools in dealing with spatially dependent data and are widely used in many fields such as spatial econometric and regional science. When the spatial data contain sensitive information, the privacy of the data will be compromised along with the release of the analysis if appropriate privacy-preserving measures are not in place. In this paper, we study the privacy-preserving parametric inference for the spatial autoregressive model and propose corresponding differentially private algorithm. We construct a differentially private spatial autoregression framework that takes graph data into account. We improve the functional mechanism to be more accurate under the same degree of privacy protection. Theoretical analysis establishes both the privacy guarantees of the algorithm and the asymptotic normality of the estimation. Simulation and real data studies show improvements of our approach.

空间回归模型是处理空间相关数据的重要工具,广泛应用于空间计量经济学和区域科学等多个领域。当空间数据包含敏感信息时,如果不采取适当的隐私保护措施,数据的隐私将受到损害,同时分析结果也会被泄露。本文研究了空间自回归模型的隐私保护参数推断,并提出了相应的差异化隐私算法。我们构建了一个考虑到图形数据的差异化私有空间自回归框架。我们改进了函数机制,使其在同等程度的隐私保护下更加精确。理论分析确定了算法的隐私保证和估计的渐近正态性。仿真和真实数据研究显示了我们方法的改进。
{"title":"Privacy-preserving parametric inference for spatial autoregressive model","authors":"Zhijian Wang, Yunquan Song","doi":"10.1007/s11749-024-00928-8","DOIUrl":"https://doi.org/10.1007/s11749-024-00928-8","url":null,"abstract":"<p>Spatial regression models are important tools in dealing with spatially dependent data and are widely used in many fields such as spatial econometric and regional science. When the spatial data contain sensitive information, the privacy of the data will be compromised along with the release of the analysis if appropriate privacy-preserving measures are not in place. In this paper, we study the privacy-preserving parametric inference for the spatial autoregressive model and propose corresponding differentially private algorithm. We construct a differentially private spatial autoregression framework that takes graph data into account. We improve the functional mechanism to be more accurate under the same degree of privacy protection. Theoretical analysis establishes both the privacy guarantees of the algorithm and the asymptotic normality of the estimation. Simulation and real data studies show improvements of our approach.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"26 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140586664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiple change point detection for high-dimensional data 高维数据的多变化点检测
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-25 DOI: 10.1007/s11749-024-00926-w
Wenbiao Zhao, Lixing Zhu, Falong Tan

This research investigates the detection of multiple change points in high-dimensional data without particular sparse or dense structure, where the dimension can be of exponential order in relation to the sample size. The estimation approach proposed employs a signal statistic based on a sequence of signal screening-based local U-statistics. This technique avoids costly computations that exhaustive search algorithms require and mitigates false positives, which hypothesis testing-based methods need to control. Consistency of estimation can be achieved for both the locations and number of change points, even when the number of change points diverges at a certain rate as the sample size increases. Additionally, the visualization nature of the proposed approach makes plotting the signal statistic a useful tool to identify locations of change points, which distinguishes it from existing methods in the literature. Numerical studies are performed to evaluate the effectiveness of the proposed technique in finite sample scenarios, and a real data analysis is presented to illustrate its application.

本研究探讨了在无特定稀疏或密集结构的高维数据中检测多个变化点的问题,其中维数可能是与样本大小相关的指数阶。所提出的估计方法采用了基于信号筛选的局部 U 统计序列的信号统计。这种技术避免了穷举搜索算法所需的昂贵计算,并减少了基于假设检验的方法需要控制的假阳性。即使随着样本量的增加,变化点的数量以一定的速度发生变化,也能实现对变化点位置和数量的一致性估计。此外,所提方法的可视化特性使绘制信号统计图成为识别变化点位置的有用工具,这使其有别于文献中的现有方法。我们进行了数值研究,以评估拟议技术在有限样本情况下的有效性,并通过实际数据分析来说明其应用。
{"title":"Multiple change point detection for high-dimensional data","authors":"Wenbiao Zhao, Lixing Zhu, Falong Tan","doi":"10.1007/s11749-024-00926-w","DOIUrl":"https://doi.org/10.1007/s11749-024-00926-w","url":null,"abstract":"<p>This research investigates the detection of multiple change points in high-dimensional data without particular sparse or dense structure, where the dimension can be of exponential order in relation to the sample size. The estimation approach proposed employs a signal statistic based on a sequence of signal screening-based local U-statistics. This technique avoids costly computations that exhaustive search algorithms require and mitigates false positives, which hypothesis testing-based methods need to control. Consistency of estimation can be achieved for both the locations and number of change points, even when the number of change points diverges at a certain rate as the sample size increases. Additionally, the visualization nature of the proposed approach makes plotting the signal statistic a useful tool to identify locations of change points, which distinguishes it from existing methods in the literature. Numerical studies are performed to evaluate the effectiveness of the proposed technique in finite sample scenarios, and a real data analysis is presented to illustrate its application.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"19 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140297606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rejoinder on: Shape-based functional data analysis 关于:基于形状的功能数据分析
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-18 DOI: 10.1007/s11749-024-00925-x
Yuexuan Wu, Chao Huang, Anuj Srivastava

We express our gratitude to the authors of five comment articles for their valuable contributions, feedback, and recommendations on our discussion document (Wu et al. Test, 2023). All the reviewers acknowledged the value of our proposed research direction, which focuses on shape-based functional data analysis. They also provided insightful suggestions to enhance and expand upon these ideas. In this response, we address their comments and provide further insights.

我们对五篇评论文章的作者表示感谢,感谢他们对我们的讨论文件(Wu 等人 Test, 2023)所做的宝贵贡献、反馈和建议。所有审稿人都肯定了我们提出的研究方向的价值,该方向侧重于基于形状的功能数据分析。他们还就如何加强和扩展这些想法提出了有见地的建议。在本回复中,我们将回应他们的意见并提供进一步的见解。
{"title":"Rejoinder on: Shape-based functional data analysis","authors":"Yuexuan Wu, Chao Huang, Anuj Srivastava","doi":"10.1007/s11749-024-00925-x","DOIUrl":"https://doi.org/10.1007/s11749-024-00925-x","url":null,"abstract":"<p>We express our gratitude to the authors of five comment articles for their valuable contributions, feedback, and recommendations on our discussion document (Wu et al. Test, 2023). All the reviewers acknowledged the value of our proposed research direction, which focuses on shape-based functional data analysis. They also provided insightful suggestions to enhance and expand upon these ideas. In this response, we address their comments and provide further insights.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"22 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140166224","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data integration via analysis of subspaces (DIVAS) 通过子空间分析进行数据整合(DIVAS)
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-03-14 DOI: 10.1007/s11749-024-00923-z
Jack Prothero, Meilei Jiang, Jan Hannig, Quoc Tran-Dinh, Andrew Ackerman, J. S. Marron

Modern data collection in many data paradigms, including bioinformatics, often incorporates multiple traits derived from different data types (i.e., platforms). We call this data multi-block, multi-view, or multi-omics data. The emergent field of data integration develops and applies new methods for studying multi-block data and identifying how different data types relate and differ. One major frontier in contemporary data integration research is methodology that can identify partially shared structure between sub-collections of data types. This work presents a new approach: Data Integration Via Analysis of Subspaces (DIVAS). DIVAS combines new insights in angular subspace perturbation theory with recent developments in matrix signal processing and convex–concave optimization into one algorithm for exploring partially shared structure. Based on principal angles between subspaces, DIVAS provides built-in inference on the results of the analysis, and is effective even in high-dimension-low-sample-size (HDLSS) situations.

在包括生物信息学在内的许多数据范式中,现代数据收集通常包含来自不同数据类型(即平台)的多种特征。我们称这种数据为多块、多视角或多组学数据。新兴的数据整合领域开发并应用了新方法来研究多块数据,并确定不同数据类型之间的关系和差异。当代数据整合研究的一个主要前沿领域是能够识别数据类型子集合之间部分共享结构的方法。这项工作提出了一种新方法:通过子空间分析进行数据整合(DIVAS)。DIVAS 将角度子空间扰动理论的新见解与矩阵信号处理和凸凹优化的最新发展相结合,成为一种探索部分共享结构的算法。DIVAS 基于子空间之间的主角,可对分析结果进行内置推理,即使在高维度-低样本量(HDLSS)的情况下也很有效。
{"title":"Data integration via analysis of subspaces (DIVAS)","authors":"Jack Prothero, Meilei Jiang, Jan Hannig, Quoc Tran-Dinh, Andrew Ackerman, J. S. Marron","doi":"10.1007/s11749-024-00923-z","DOIUrl":"https://doi.org/10.1007/s11749-024-00923-z","url":null,"abstract":"<p>Modern data collection in many data paradigms, including bioinformatics, often incorporates multiple traits derived from different data types (i.e., platforms). We call this data multi-block, multi-view, or multi-omics data. The emergent field of data integration develops and applies new methods for studying multi-block data and identifying how different data types relate and differ. One major frontier in contemporary data integration research is methodology that can identify partially shared structure between sub-collections of data types. This work presents a new approach: Data Integration Via Analysis of Subspaces (DIVAS). DIVAS combines new insights in angular subspace perturbation theory with recent developments in matrix signal processing and convex–concave optimization into one algorithm for exploring partially shared structure. Based on principal angles between subspaces, DIVAS provides built-in inference on the results of the analysis, and is effective even in high-dimension-low-sample-size (HDLSS) situations.</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"31 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140152205","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing covariance structures belonging to a quadratic subspace under a doubly multivariate model 在双多元模型下测试属于二次子空间的协方差结构
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-28 DOI: 10.1007/s11749-024-00922-0

Abstract

A hypothesis related to the block structure of a covariance matrix under the doubly multivariate normal model is studied. It is assumed that the block structure of the covariance matrix belongs to a quadratic subspace, and under the null hypothesis, each block of the covariance matrix also has a structure belonging to some quadratic subspace. The Rao score and the likelihood ratio test statistics are derived, and the exact distribution of the likelihood ratio test is determined. Simulation studies show the advantage of the Rao score test over the likelihood ratio test in terms of speed of convergence to the limiting chi-square distribution, while both proposed tests are competitive in terms of their power. The results are applied to both simulated and real-life example data.

摘要 研究了与双多元正态模型下协方差矩阵的块结构有关的假设。假设协方差矩阵的块结构属于二次子空间,在零假设下,协方差矩阵的每个块也具有属于某个二次子空间的结构。得出了 Rao 分数和似然比检验统计量,并确定了似然比检验的精确分布。模拟研究表明,就收敛到极限奇平方分布的速度而言,拉奥分数检验比似然比检验更有优势,而所提出的两种检验在功率方面都具有竞争力。研究结果同时应用于模拟数据和现实示例数据。
{"title":"Testing covariance structures belonging to a quadratic subspace under a doubly multivariate model","authors":"","doi":"10.1007/s11749-024-00922-0","DOIUrl":"https://doi.org/10.1007/s11749-024-00922-0","url":null,"abstract":"<h3>Abstract</h3> <p>A hypothesis related to the block structure of a covariance matrix under the doubly multivariate normal model is studied. It is assumed that the block structure of the covariance matrix belongs to a quadratic subspace, and under the null hypothesis, each block of the covariance matrix also has a structure belonging to some quadratic subspace. The Rao score and the likelihood ratio test statistics are derived, and the exact distribution of the likelihood ratio test is determined. Simulation studies show the advantage of the Rao score test over the likelihood ratio test in terms of speed of convergence to the limiting chi-square distribution, while both proposed tests are competitive in terms of their power. The results are applied to both simulated and real-life example data. </p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"8 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140001677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The orthogonal skew model: computationally efficient multivariate skew-normal and skew-t distributions with applications to model-based clustering 正交偏斜模型:计算效率高的多元偏斜正态分布和偏斜-t 分布及其在基于模型的聚类中的应用
IF 1.3 4区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-02-26 DOI: 10.1007/s11749-024-00920-2
Ryan P. Browne, Jeffrey L. Andrews

We introduce a parameterization for the multivariate skew normal and skew-t distributions, which enforces an orthogonal structure on the skewness parameter. This approach provides substantial benefits in computational efficiency during parameter estimation, resulting in a model which strikes an excellent balance between flexibility and model-fitting feasibility. We illustrate this primarily through implementing the proposed distributions in a mixture model-based clustering framework. We compare to competing skew distributions via both simulated and real data analyses, reporting both computation time and model-fit metrics.

我们为多元偏态正态分布和偏态-t 分布引入了一种参数化方法,该方法对偏度参数强制采用正交结构。这种方法大大提高了参数估计过程中的计算效率,使模型在灵活性和模型拟合可行性之间达到了极佳的平衡。我们主要通过在基于混合模型的聚类框架中实施所提出的分布来说明这一点。我们通过模拟和真实数据分析,将其与竞争性偏斜分布进行了比较,并报告了计算时间和模型拟合指标。
{"title":"The orthogonal skew model: computationally efficient multivariate skew-normal and skew-t distributions with applications to model-based clustering","authors":"Ryan P. Browne, Jeffrey L. Andrews","doi":"10.1007/s11749-024-00920-2","DOIUrl":"https://doi.org/10.1007/s11749-024-00920-2","url":null,"abstract":"<p>We introduce a parameterization for the multivariate skew normal and skew-<i>t</i> distributions, which enforces an orthogonal structure on the skewness parameter. This approach provides substantial benefits in computational efficiency during parameter estimation, resulting in a model which strikes an excellent balance between flexibility and model-fitting feasibility. We illustrate this primarily through implementing the proposed distributions in a mixture model-based clustering framework. We compare to competing skew distributions via both simulated and real data analyses, reporting both computation time and model-fit metrics.\u0000</p>","PeriodicalId":51189,"journal":{"name":"Test","volume":"42 1","pages":""},"PeriodicalIF":1.3,"publicationDate":"2024-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139968590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Test
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1