首页 > 最新文献

Statistica Sinica最新文献

英文 中文
STATISTICAL INFERENCE FOR MEAN FUNCTIONS OF COMPLEX 3D OBJECTS. 复杂三维物体平均函数的统计推断。
IF 1.2 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-07-01 DOI: 10.5705/ss.202023.0071
Yueying Wang, Guannan Wang, Brandon Klinedinst, Auriel Willette, Li Wang

The use of complex three-dimensional (3D) objects is growing in various applications as data collection techniques continue to evolve. Identifying and locating significant effects within these objects is essential for making informed decisions based on the data. This article presents an advanced nonparametric method for learning and inferring complex 3D objects, enabling accurate estimation of the underlying signals and efficient detection and localization of significant effects. The proposed method addresses the problem of analyzing irregular-shaped 3D objects by modeling them as functional data and utilizing trivariate spline smoothing based on triangulations to estimate the underlying signals. We develop a highly efficient procedure that accurately estimates the mean and covariance functions, as well as the eigenvalues and eigenfunctions. Furthermore, we rigorously establish the asymptotic properties of these estimators. Additionally, a novel approach for constructing simultaneous confidence corridors to quantify estimation uncertainty is presented, and the procedure is extended to accommodate comparisons between two independent samples. The finite-sample performance of the proposed methods is illustrated through numerical experiments and a real-data application using the Alzheimer's Disease Neuroimaging Initiative database.

随着数据收集技术的不断发展,复杂三维(3D)对象的使用在各种应用中不断增长。识别和定位这些对象中的重大影响对于根据数据做出明智的决策至关重要。本文提出了一种先进的非参数方法来学习和推断复杂的三维物体,能够准确估计潜在的信号,并有效地检测和定位重要的影响。该方法通过将不规则形状的三维物体建模为功能数据,并利用基于三角剖分的三元样条平滑来估计底层信号,从而解决了分析不规则形状三维物体的问题。我们开发了一个高效的程序,可以准确地估计均值和协方差函数,以及特征值和特征函数。进一步,我们严格地建立了这些估计量的渐近性质。此外,提出了一种构建同步置信走廊的新方法来量化估计不确定性,并扩展了该过程以适应两个独立样本之间的比较。通过数值实验和使用阿尔茨海默病神经成像倡议数据库的实际数据应用,说明了所提出方法的有限样本性能。
{"title":"STATISTICAL INFERENCE FOR MEAN FUNCTIONS OF COMPLEX 3D OBJECTS.","authors":"Yueying Wang, Guannan Wang, Brandon Klinedinst, Auriel Willette, Li Wang","doi":"10.5705/ss.202023.0071","DOIUrl":"10.5705/ss.202023.0071","url":null,"abstract":"<p><p>The use of complex three-dimensional (3D) objects is growing in various applications as data collection techniques continue to evolve. Identifying and locating significant effects within these objects is essential for making informed decisions based on the data. This article presents an advanced nonparametric method for learning and inferring complex 3D objects, enabling accurate estimation of the underlying signals and efficient detection and localization of significant effects. The proposed method addresses the problem of analyzing irregular-shaped 3D objects by modeling them as functional data and utilizing trivariate spline smoothing based on triangulations to estimate the underlying signals. We develop a highly efficient procedure that accurately estimates the mean and covariance functions, as well as the eigenvalues and eigenfunctions. Furthermore, we rigorously establish the asymptotic properties of these estimators. Additionally, a novel approach for constructing simultaneous confidence corridors to quantify estimation uncertainty is presented, and the procedure is extended to accommodate comparisons between two independent samples. The finite-sample performance of the proposed methods is illustrated through numerical experiments and a real-data application using the Alzheimer's Disease Neuroimaging Initiative database.</p>","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":"1451-1477"},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12419769/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70939966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-dimensional Subgroup Regression Analysis. 高维亚群回归分析。
IF 1.2 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-07-01 DOI: 10.5705/ss.202023.0075
Fei Jiang, Lu Tian, Jian Kang, Lexin Li

Classical regression generally assumes that all subjects follow a common model with the same set of parameters. With ever advancing capabilities of modern technologies to collect more subjects and more covariates, it has become increasingly common that there exist subgroups of subjects, and each group follows a different regression model with a different set of parameters. In this article, we propose a new approach for subgroup analysis in regression modeling. Specifically, we model the relation between a response and a set of primary predictors, while we explicitly model the heterogenous association given another set of auxiliary predictors, through the interaction between the primary and auxiliary variables. We introduce penalties to induce the sparsity and group structures within the regression coefficients, and to achieve simultaneous feature selection for both primary predictors that are significantly associated with the response, as well as the auxiliary predictors that define the subgroups. We establish the asymptotic guarantees in terms of parameter estimation consistency and cluster estimation consistency. We illustrate our method with an analysis of the functional magnetic resonance imaging data from the Adolescent Brain Cognitive Development Study.

经典回归通常假设所有受试者遵循具有相同参数集的共同模型。随着现代技术的不断进步,可以收集更多的主题和更多的协变量,存在主题的子组已经变得越来越普遍,每个子组遵循不同的回归模型和不同的参数集。本文提出了回归模型中子群分析的一种新方法。具体来说,我们对响应和一组主要预测因子之间的关系进行了建模,同时通过主要变量和辅助变量之间的相互作用,明确地对另一组辅助预测因子的异质关联进行了建模。我们引入惩罚来诱导回归系数中的稀疏性和组结构,并同时实现与响应显著相关的主要预测因子以及定义子组的辅助预测因子的特征选择。建立了参数估计一致性和聚类估计一致性的渐近保证。我们通过分析青少年大脑认知发展研究的功能磁共振成像数据来说明我们的方法。
{"title":"High-dimensional Subgroup Regression Analysis.","authors":"Fei Jiang, Lu Tian, Jian Kang, Lexin Li","doi":"10.5705/ss.202023.0075","DOIUrl":"10.5705/ss.202023.0075","url":null,"abstract":"<p><p>Classical regression generally assumes that all subjects follow a common model with the same set of parameters. With ever advancing capabilities of modern technologies to collect more subjects and more covariates, it has become increasingly common that there exist subgroups of subjects, and each group follows a different regression model with a different set of parameters. In this article, we propose a new approach for subgroup analysis in regression modeling. Specifically, we model the relation between a response and a set of primary predictors, while we explicitly model the heterogenous association given another set of auxiliary predictors, through the interaction between the primary and auxiliary variables. We introduce penalties to induce the sparsity and group structures within the regression coefficients, and to achieve simultaneous feature selection for both primary predictors that are significantly associated with the response, as well as the auxiliary predictors that define the subgroups. We establish the asymptotic guarantees in terms of parameter estimation consistency and cluster estimation consistency. We illustrate our method with an analysis of the functional magnetic resonance imaging data from the Adolescent Brain Cognitive Development Study.</p>","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"35 3","pages":"1713-1736"},"PeriodicalIF":1.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12344502/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144849473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules. 基于随机决策规则的生成对抗网络新范式。
IF 1.5 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2025-04-01 DOI: 10.5705/ss.202022.0404
Sehwan Kim, Qifan Song, Faming Liang

The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity among generated data. In this paper, we identify the reasons why the GAN suffers from this issue, and to address it, we propose a new formulation for the GAN based on randomized decision rules. In the new formulation, the discriminator converges to a fixed point while the generator converges to a distribution at the Nash equilibrium. We propose to train the GAN by an empirical Bayes-like method by treating the discriminator as a hyper-parameter of the posterior distribution of the generator. Specifically, we simulate generators from its posterior distribution conditioned on the discriminator using a stochastic gradient Markov chain Monte Carlo (MCMC) algorithm, and update the discriminator using stochastic gradient descent along with simulations of the generators. We establish convergence of the proposed method to the Nash equilibrium. Apart from image generation, we apply the proposed method to nonparametric clustering and nonparametric conditional independence tests. A portion of the numerical results is presented in the supplementary material.

生成对抗网络(GAN)作为一种训练生成模型的新型机器学习方法最近在文献中被介绍。它在非参数聚类和非参数条件独立检验等统计学中有广泛的应用。然而,由于模式崩溃的问题,训练GAN是出了名的困难,模式崩溃是指生成的数据之间缺乏多样性。在本文中,我们确定了GAN遭受此问题的原因,并提出了一种基于随机决策规则的GAN新公式。在新公式中,鉴别器收敛到一个不动点,而发生器收敛到纳什平衡点的一个分布。我们建议通过经验贝叶斯方法训练GAN,将鉴别器视为生成器后验分布的超参数。具体来说,我们使用随机梯度马尔可夫链蒙特卡罗(MCMC)算法从判别器条件下的后验分布模拟生成器,并使用随机梯度下降随着生成器的模拟更新鉴别器。并证明了该方法对纳什均衡的收敛性。除了图像生成之外,我们还将该方法应用于非参数聚类和非参数条件独立性检验。部分数值结果载于补充资料中。
{"title":"A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules.","authors":"Sehwan Kim, Qifan Song, Faming Liang","doi":"10.5705/ss.202022.0404","DOIUrl":"https://doi.org/10.5705/ss.202022.0404","url":null,"abstract":"<p><p>The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity among generated data. In this paper, we identify the reasons why the GAN suffers from this issue, and to address it, we propose a new formulation for the GAN based on randomized decision rules. In the new formulation, the discriminator converges to a fixed point while the generator converges to a distribution at the Nash equilibrium. We propose to train the GAN by an empirical Bayes-like method by treating the discriminator as a hyper-parameter of the posterior distribution of the generator. Specifically, we simulate generators from its posterior distribution conditioned on the discriminator using a stochastic gradient Markov chain Monte Carlo (MCMC) algorithm, and update the discriminator using stochastic gradient descent along with simulations of the generators. We establish convergence of the proposed method to the Nash equilibrium. Apart from image generation, we apply the proposed method to nonparametric clustering and nonparametric conditional independence tests. A portion of the numerical results is presented in the supplementary material.</p>","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"35 2","pages":"897-918"},"PeriodicalIF":1.5,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12017776/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144028320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-response Regression for Block-missing Multi-modal Data without Imputation. 针对块缺失多模态数据的多响应回归,无需估算。
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-04-01 DOI: 10.5705/ss.202021.0170
Haodong Wang, Quefeng Li, Yufeng Liu

Multi-modal data are prevalent in many scientific fields. In this study, we consider the parameter estimation and variable selection for a multi-response regression using block-missing multi-modal data. Our method allows the dimensions of both the responses and the predictors to be large, and the responses to be incomplete and correlated, a common practical problem in high-dimensional settings. Our proposed method uses two steps to make a prediction from a multi-response linear regression model with block-missing multi-modal predictors. In the first step, without imputing missing data, we use all available data to estimate the covariance matrix of the predictors and the cross-covariance matrix between the predictors and the responses. In the second step, we use these matrices and a penalized method to simultaneously estimate the precision matrix of the response vector, given the predictors, and the sparse regression parameter matrix. Lastly, we demonstrate the effectiveness of the proposed method using theoretical studies, simulated examples, and an analysis of a multi-modal imaging data set from the Alzheimer's Disease Neuroimaging Initiative.

多模态数据在很多科学领域都很普遍。在本研究中,我们考虑了使用块缺失多模态数据进行多响应回归的参数估计和变量选择。我们的方法允许响应和预测因子的维度都很大,并且响应是不完整和相关的,这是高维环境中常见的实际问题。我们提出的方法采用两个步骤,对带有块缺失多模态预测因子的多响应线性回归模型进行预测。第一步,在不计算缺失数据的情况下,我们使用所有可用数据来估计预测因子的协方差矩阵以及预测因子与响应之间的交叉协方差矩阵。在第二步中,我们使用这些矩阵和一种惩罚性方法来同时估计响应向量的精度矩阵(给定预测因子)和稀疏回归参数矩阵。最后,我们通过理论研究、模拟示例以及对阿尔茨海默病神经成像计划多模态成像数据集的分析,证明了所提方法的有效性。
{"title":"Multi-response Regression for Block-missing Multi-modal Data without Imputation.","authors":"Haodong Wang, Quefeng Li, Yufeng Liu","doi":"10.5705/ss.202021.0170","DOIUrl":"10.5705/ss.202021.0170","url":null,"abstract":"<p><p>Multi-modal data are prevalent in many scientific fields. In this study, we consider the parameter estimation and variable selection for a multi-response regression using block-missing multi-modal data. Our method allows the dimensions of both the responses and the predictors to be large, and the responses to be incomplete and correlated, a common practical problem in high-dimensional settings. Our proposed method uses two steps to make a prediction from a multi-response linear regression model with block-missing multi-modal predictors. In the first step, without imputing missing data, we use all available data to estimate the covariance matrix of the predictors and the cross-covariance matrix between the predictors and the responses. In the second step, we use these matrices and a penalized method to simultaneously estimate the precision matrix of the response vector, given the predictors, and the sparse regression parameter matrix. Lastly, we demonstrate the effectiveness of the proposed method using theoretical studies, simulated examples, and an analysis of a multi-modal imaging data set from the Alzheimer's Disease Neuroimaging Initiative.</p>","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":"527-546"},"PeriodicalIF":1.4,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11035992/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact Analysis for Spatial Autoregressive Models: With Application to Air Pollution in China 空间自回归模型对中国大气污染的影响分析
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0119
Hsuan-Yu Chang, Jihai Yu
: In this paper, we investigate impact analysis and its asymptotic inference for spatial autoregressive models. LeSage and Pace (2009) introduce impact analysis for spatial models and use Monte Carlo simulations to compute the dispersion. We propose to use the delta method, which enables us to obtain the dispersion in an explicit form. In addition, we provide the element-wise impact analysis. We first study the cross-sectional case, where various impacts are introduced to measure the interaction and feedback effects in a space dimension. We then study the spatial dynamic panel case with simultaneous spatial and dynamic feedback involved in the impacts. Monte Carlo results show that the proposed impact analysis has satisfactory finite sample properties. Finally, we apply impact analysis to investigate how meteorological factors and air pollutants affect PM 2 . 5 in Chinese cities.
研究空间自回归模型的影响分析及其渐近推断。我们建议使用delta方法,它使我们能够以显式形式获得色散。此外,我们还提供元素影响分析。我们首先研究了横截面案例,其中引入了各种影响来测量空间维度上的相互作用和反馈效应。然后,我们研究了空间动态面板的情况下,同时空间和动态反馈的影响。蒙特卡罗结果表明,所提出的冲击分析具有令人满意的有限样本性质。最后,运用影响分析方法探讨气象因子和大气污染物对pm2的影响。5个在中国城市。
{"title":"Impact Analysis for Spatial Autoregressive Models: With Application to Air Pollution in China","authors":"Hsuan-Yu Chang, Jihai Yu","doi":"10.5705/ss.202021.0119","DOIUrl":"https://doi.org/10.5705/ss.202021.0119","url":null,"abstract":": In this paper, we investigate impact analysis and its asymptotic inference for spatial autoregressive models. LeSage and Pace (2009) introduce impact analysis for spatial models and use Monte Carlo simulations to compute the dispersion. We propose to use the delta method, which enables us to obtain the dispersion in an explicit form. In addition, we provide the element-wise impact analysis. We first study the cross-sectional case, where various impacts are introduced to measure the interaction and feedback effects in a space dimension. We then study the spatial dynamic panel case with simultaneous spatial and dynamic feedback involved in the impacts. Monte Carlo results show that the proposed impact analysis has satisfactory finite sample properties. Finally, we apply impact analysis to investigate how meteorological factors and air pollutants affect PM 2 . 5 in Chinese cities.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"36 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonlinear dimension reduction for functional data with application to clustering 函数数据非线性降维及其在聚类中的应用
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0393
Ruoxu Tan, Yiming Zang, G. Yin
Nonlinear dimension reduction for functional data with application to clustering
功能数据通常具有非线性结构,例如相位变化,因此线性降维技术可能无效。基于假设数据位于一个未知的带有噪声的流形上,研究了函数数据的非线性降维问题。我们将最近开发的用于高维数据的流形学习方法推广到我们的环境中,并在考虑噪声的情况下推导出渐近收敛结果。基于综合算例的结果往往比传统的功能等高线方法产生更精确的测地线距离估计。我们进一步开发了一种基于流形学习结果的聚类策略,并证明如果数据位于弯曲流形上,我们的方法优于其他方法。给出了两个实际数据示例来说明。1.中国统计:预印本doi:10.5705/ss.202021.0393
{"title":"Nonlinear dimension reduction for functional data with application to clustering","authors":"Ruoxu Tan, Yiming Zang, G. Yin","doi":"10.5705/ss.202021.0393","DOIUrl":"https://doi.org/10.5705/ss.202021.0393","url":null,"abstract":"Nonlinear dimension reduction for functional data with application to clustering","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unbiased Boosting Estimation for Censored Survival Data 删节生存数据的无偏增强估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0050
Li‐Pang Chen, G. Yi
: Boosting methods have been broadly discussed for various settings, and most methods handle data with complete observations. Although some methods are available for survival data with censored responses, they tend to assume a specific model for the survival process, and most provide numerical implementation procedures without rigorous theoretical justifications. In this paper, we develop an unbiased boosting estimation method for censored survival data, without assuming an explicit model, and explore three strategies for adjusting the loss functions, while accommodating censoring effects. We implement the proposed method using a functional gradient descent algorithm, and rigorously establish our theoretical results, including the consistency and optimization convergence. Our numerical studies show that the proposed method exhibits satisfactory performance in finite-sample settings.
对于各种设置的增强方法已经进行了广泛的讨论,并且大多数方法处理具有完整观测值的数据。虽然有些方法可用于带有删减响应的生存数据,但它们倾向于假设生存过程的特定模型,并且大多数方法提供的数值实现程序没有严格的理论依据。在本文中,我们开发了一种无偏的增强估计方法,不假设一个显式模型,并探讨了三种策略来调整损失函数,同时适应审查的影响。我们使用泛函梯度下降算法实现了所提出的方法,并严格验证了我们的理论结果,包括一致性和优化收敛性。数值研究表明,该方法在有限样本条件下具有令人满意的性能。Grace Yi是通讯作者。电子邮件:gyi5@uwo.ca中国统计:预印本doi:10.5705/ss.202021.0050
{"title":"Unbiased Boosting Estimation for Censored Survival Data","authors":"Li‐Pang Chen, G. Yi","doi":"10.5705/ss.202021.0050","DOIUrl":"https://doi.org/10.5705/ss.202021.0050","url":null,"abstract":": Boosting methods have been broadly discussed for various settings, and most methods handle data with complete observations. Although some methods are available for survival data with censored responses, they tend to assume a specific model for the survival process, and most provide numerical implementation procedures without rigorous theoretical justifications. In this paper, we develop an unbiased boosting estimation method for censored survival data, without assuming an explicit model, and explore three strategies for adjusting the loss functions, while accommodating censoring effects. We implement the proposed method using a functional gradient descent algorithm, and rigorously establish our theoretical results, including the consistency and optimization convergence. Our numerical studies show that the proposed method exhibits satisfactory performance in finite-sample settings.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70936904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parsimonious Tensor Discriminant Analysis 简约张量判别分析
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202020.0496
Ning Wang, Wenjing Wang, Xin Zhang
: Discriminant analyses of multidimensional array data (i.e., tensors) are of substantial interest in numerous statistics and engineering research problems, such as signal processing, imaging, genetics, and brain–computer interfaces. In this study, we consider a multi-class discriminant analysis with a tensor-variate predictor and a categorical response. To overcome the high dimensionality and to exploit the tensor correlation structure, we propose the discriminant analysis with tensor envelope (DATE) model for simultaneous dimension reduction and classification. We extend the notion of tensor envelopes from regression to discriminant analysis and develop two complementary estimation procedures: DATE-L is a likelihood-based estimator that is shown to be asymptotically efficient when the sample size goes to infinity and the tensor dimension is fixed; DATE-D is a novel decomposition-based estimator suitable for high-dimensional problems. Interestingly, we show that DATE-D is still root-n consistent, even when the tensor dimensions on each model grow arbitrarily fast, but at a similar rate. We demonstrate the robustness and effi-ciency of our estimators using extensive simulations and real-data examples.
多维阵列数据(即张量)的判别分析在许多统计学和工程研究问题中具有重要意义,例如信号处理、成像、遗传学和脑机接口。在这项研究中,我们考虑了一个具有张量变量预测器和分类响应的多类判别分析。为了克服数据的高维性和利用张量关联结构,我们提出了基于张量包络(DATE)模型的判别分析方法,用于同时进行降维和分类。我们将张量包膜的概念从回归扩展到判别分析,并开发了两个互补的估计过程:DATE-L是一个基于似然的估计器,当样本量趋于无穷大且张量维固定时,它是渐近有效的;DATE-D是一种适用于高维问题的新的基于分解的估计器。有趣的是,我们证明DATE-D仍然是根n一致的,即使每个模型上的张量维以任意快的速度增长,但速度相似。我们通过大量的模拟和实际数据示例证明了我们的估计器的鲁棒性和效率。
{"title":"Parsimonious Tensor Discriminant Analysis","authors":"Ning Wang, Wenjing Wang, Xin Zhang","doi":"10.5705/ss.202020.0496","DOIUrl":"https://doi.org/10.5705/ss.202020.0496","url":null,"abstract":": Discriminant analyses of multidimensional array data (i.e., tensors) are of substantial interest in numerous statistics and engineering research problems, such as signal processing, imaging, genetics, and brain–computer interfaces. In this study, we consider a multi-class discriminant analysis with a tensor-variate predictor and a categorical response. To overcome the high dimensionality and to exploit the tensor correlation structure, we propose the discriminant analysis with tensor envelope (DATE) model for simultaneous dimension reduction and classification. We extend the notion of tensor envelopes from regression to discriminant analysis and develop two complementary estimation procedures: DATE-L is a likelihood-based estimator that is shown to be asymptotically efficient when the sample size goes to infinity and the tensor dimension is fixed; DATE-D is a novel decomposition-based estimator suitable for high-dimensional problems. Interestingly, we show that DATE-D is still root-n consistent, even when the tensor dimensions on each model grow arbitrarily fast, but at a similar rate. We demonstrate the robustness and effi-ciency of our estimators using extensive simulations and real-data examples.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70936940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Zero-imputation Approach in Recommendation Systems with Data Missing Heterogeneously 数据异构缺失推荐系统中的零归一化方法
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0429
Jiashen Lu, Kehui Chen
{"title":"A Zero-imputation Approach in Recommendation Systems with Data Missing Heterogeneously","authors":"Jiashen Lu, Kehui Chen","doi":"10.5705/ss.202021.0429","DOIUrl":"https://doi.org/10.5705/ss.202021.0429","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70938173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Kernel Regression Utilizing External Information as Constraints 利用外部信息作为约束的核回归
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2024-01-01 DOI: 10.5705/ss.202021.0446
Chi-Shian Dai, Jun Shao
{"title":"Kernel Regression Utilizing External Information as Constraints","authors":"Chi-Shian Dai, Jun Shao","doi":"10.5705/ss.202021.0446","DOIUrl":"https://doi.org/10.5705/ss.202021.0446","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70938185","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Statistica Sinica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1