首页 > 最新文献

Statistica Sinica最新文献

英文 中文
Semiparametric Estimation of Non-ignorable Missingness With Refreshment Sample 点心样本不可忽略缺失的半参数估计
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0214
Jianfei Zheng, Jing Wang, L. Xue, A. Qu
{"title":"Semiparametric Estimation of Non-ignorable Missingness With Refreshment Sample","authors":"Jianfei Zheng, Jing Wang, L. Xue, A. Qu","doi":"10.5705/ss.202022.0214","DOIUrl":"https://doi.org/10.5705/ss.202022.0214","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70939023","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A More Efficient Isomorphism Check for Two-Level Nonregular Designs 两级不规则设计的一种更有效的同构检验
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0200
Chunyan Wang, Robert W. Mee
: In this paper, we propose some new necessary and sufficient conditions for identifying isomorphism in two-level fractional factorial designs, using a parallel flats structure. A new algorithm for checking isomorphism is provided accordingly. The proposed algorithm is simple and general, and can be used for either regular or nonregular designs. By taking advantage of the parallel flats structure when it exists, the method is much faster than current methods for assessing the isomorphism of nonregular two-level designs. Examples are given to illustrate the results. An efficient implementation of the proposed algorithm in Matlab can be found in the online Supplementary Material.
本文利用平行平面结构,给出了判别二水平分数阶乘设计同构的几个新的充要条件。据此提出了一种新的同构检验算法。该算法简单、通用,可用于规则或非规则设计。该方法利用平行平面结构存在时的优势,比现有的非规则两层设计的同构性评估方法快得多。最后给出了算例来说明结果。在在线补充材料中可以找到在Matlab中有效实现所提出算法的方法。
{"title":"A More Efficient Isomorphism Check for Two-Level Nonregular Designs","authors":"Chunyan Wang, Robert W. Mee","doi":"10.5705/ss.202022.0200","DOIUrl":"https://doi.org/10.5705/ss.202022.0200","url":null,"abstract":": In this paper, we propose some new necessary and sufficient conditions for identifying isomorphism in two-level fractional factorial designs, using a parallel flats structure. A new algorithm for checking isomorphism is provided accordingly. The proposed algorithm is simple and general, and can be used for either regular or nonregular designs. By taking advantage of the parallel flats structure when it exists, the method is much faster than current methods for assessing the isomorphism of nonregular two-level designs. Examples are given to illustrate the results. An efficient implementation of the proposed algorithm in Matlab can be found in the online Supplementary Material.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70939010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse and Low-Rank Matrix Quantile Estimation With Application to Quadratic Regression 稀疏低秩矩阵分位数估计及其在二次回归中的应用
3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202021.0140
Wenqi Lu, Zhongyi Zhu, Heng Lian
{"title":"Sparse and Low-Rank Matrix Quantile Estimation With Application to Quadratic Regression","authors":"Wenqi Lu, Zhongyi Zhu, Heng Lian","doi":"10.5705/ss.202021.0140","DOIUrl":"https://doi.org/10.5705/ss.202021.0140","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135182933","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the AFT Model AFT模型下具有事件时间结果的协变量自适应随机临床试验的假设检验
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0011
Jianling Wang, Thuan Nguyen, Y. Luan, Jiming Jiang
Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the
基于事件时间结果的协变量自适应随机临床试验的假设检验
{"title":"Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the AFT Model","authors":"Jianling Wang, Thuan Nguyen, Y. Luan, Jiming Jiang","doi":"10.5705/ss.202022.0011","DOIUrl":"https://doi.org/10.5705/ss.202022.0011","url":null,"abstract":"Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning via Nonparametric Smoothing in a Continuous-Time Stochastic Setting with Noisy Data 基于非参数平滑的连续时间随机噪声环境下的强化学习
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0407
Chenyang Jiang, Bowen Hu, Yazhen Wang, Shang Wu
{"title":"Reinforcement Learning via Nonparametric Smoothing in a Continuous-Time Stochastic Setting with Noisy Data","authors":"Chenyang Jiang, Bowen Hu, Yazhen Wang, Shang Wu","doi":"10.5705/ss.202022.0407","DOIUrl":"https://doi.org/10.5705/ss.202022.0407","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70940184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypotheses Testing of Functional Principal Components 功能主成分的假设检验
IF 1.4 3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0309
Zening Song, Lijian Yang, Yuanyuan Zhang
: We propose a test for the hypothesis that the standardized functional principal components (FPCs) of functional data are equal to a given set of orthonormal bases (e.g., the Fourier basis). Using estimates of individual trajectories that satisfy certain approximation conditions, we construct a chi-square-type statistic, and show that it is oracally e(cid:14)cient under the null hypothesis, in the sense that its limiting distribution is the same as that of an infeasible statistic using all trajectories, known as the oracle." The null limiting distribution is an in(cid:12)nite Gaussian quadratic form, and we obtain a consistent estimator of its quantile. A test statistic based on the chi-squared-type statistic and the approximate quantile of the Gaussian quadratic form is shown to be both of the nominal asymptotic signi(cid:12)cance level and asymptotically correct. It is further shown that B-spline trajectory estimates meet the required approximation conditions. Simulation studies demonstrate the superior (cid:12)nite-sample performance of the proposed testing procedure. Using electroencephalogram (EEG) data, the proposed procedure con(cid:12)rms an interesting discovery that the centered EEG data are generated from a small
我们提出了一个假设的检验,即功能数据的标准化功能主成分(FPCs)等于给定的一组标准正交基(例如,傅里叶基)。使用满足某些近似条件的单个轨迹的估计,我们构造了一个凿方型统计量,并表明它在零假设下是口头有效的,因为它的极限分布与使用所有轨迹的不可行的统计量的极限分布相同,称为“神谕”。零极限分布是一个无限高斯二次型分布,得到了其分位数的一致估计。基于高斯二次型的近似分位数和轮廓型统计量的检验统计量既具有名义渐近显著性水平,又具有渐近正确性。进一步证明了b样条轨迹估计满足所需的近似条件。仿真研究表明,所提出的测试方法具有良好的有限样本性能。利用脑电图(EEG)数据,所提出的程序证实了一个有趣的发现,即集中的EEG数据是由一个小的Statistica Sinica生成的
{"title":"Hypotheses Testing of Functional Principal Components","authors":"Zening Song, Lijian Yang, Yuanyuan Zhang","doi":"10.5705/ss.202022.0309","DOIUrl":"https://doi.org/10.5705/ss.202022.0309","url":null,"abstract":": We propose a test for the hypothesis that the standardized functional principal components (FPCs) of functional data are equal to a given set of orthonormal bases (e.g., the Fourier basis). Using estimates of individual trajectories that satisfy certain approximation conditions, we construct a chi-square-type statistic, and show that it is oracally e(cid:14)cient under the null hypothesis, in the sense that its limiting distribution is the same as that of an infeasible statistic using all trajectories, known as the oracle.\" The null limiting distribution is an in(cid:12)nite Gaussian quadratic form, and we obtain a consistent estimator of its quantile. A test statistic based on the chi-squared-type statistic and the approximate quantile of the Gaussian quadratic form is shown to be both of the nominal asymptotic signi(cid:12)cance level and asymptotically correct. It is further shown that B-spline trajectory estimates meet the required approximation conditions. Simulation studies demonstrate the superior (cid:12)nite-sample performance of the proposed testing procedure. Using electroencephalogram (EEG) data, the proposed procedure con(cid:12)rms an interesting discovery that the centered EEG data are generated from a small","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70940204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Two-Sample Tests for Relevant Differences in the Eigenfunctions of Covariance Operators 协方差算子特征函数相关差异的双样本检验
3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0365
Alexander Aue, Holger Dette, Gregory Rice
This paper deals with two-sample tests for functional time series data, which have become widely available in conjunction with the advent of modern complex observation systems. Here, particular interest is in evaluating whether two sets of functional time series observations share the shape of their primary modes of variation as encoded by the eigenfunctions of the respective covariance operators. To this end, a novel testing approach is introduced that connects with, and extends, existing literature in two main ways. First, tests are set up in the relevant testing framework, where interest is not in testing an exact null hypothesis but rather in detecting deviations deemed sufficiently relevant, with relevance determined by the practitioner and perhaps guided by domain experts. Second, the proposed test statistics rely on a self-normalization principle that helps to avoid the notoriously difficult task of estimating the long-run covariance structure of the underlying functional time series. The main theoretical result of this paper is the derivation of the large-sample behavior of the proposed test statistics. Empirical evidence, indicating that the proposed procedures work well in finite samples and compare favorably with competing methods, is provided through a simulation study, and an application to annual temperature data.
本文讨论了随着现代复杂观测系统的出现而广泛应用的功能时间序列数据的双样本检验。在这里,特别感兴趣的是评估两组函数时间序列观测是否共享由各自协方差算子的特征函数编码的其主要变化模式的形状。为此,介绍了一种新的测试方法,该方法以两种主要方式连接并扩展了现有文献。首先,在相关的测试框架中设置测试,其中的兴趣不是测试精确的零假设,而是检测被认为足够相关的偏差,由从业者确定相关性,并可能由领域专家指导。其次,所提出的测试统计依赖于自归一化原则,这有助于避免估计潜在功能时间序列的长期协方差结构这一众所周知的困难任务。本文的主要理论结果是推导了所提出的检验统计量的大样本行为。通过模拟研究和对年温度数据的应用,提供了经验证据,表明所提出的方法在有限样本中效果良好,并且与竞争方法相比具有优势。
{"title":"Two-Sample Tests for Relevant Differences in the Eigenfunctions of Covariance Operators","authors":"Alexander Aue, Holger Dette, Gregory Rice","doi":"10.5705/ss.202020.0365","DOIUrl":"https://doi.org/10.5705/ss.202020.0365","url":null,"abstract":"This paper deals with two-sample tests for functional time series data, which have become widely available in conjunction with the advent of modern complex observation systems. Here, particular interest is in evaluating whether two sets of functional time series observations share the shape of their primary modes of variation as encoded by the eigenfunctions of the respective covariance operators. To this end, a novel testing approach is introduced that connects with, and extends, existing literature in two main ways. First, tests are set up in the relevant testing framework, where interest is not in testing an exact null hypothesis but rather in detecting deviations deemed sufficiently relevant, with relevance determined by the practitioner and perhaps guided by domain experts. Second, the proposed test statistics rely on a self-normalization principle that helps to avoid the notoriously difficult task of estimating the long-run covariance structure of the underlying functional time series. The main theoretical result of this paper is the derivation of the large-sample behavior of the proposed test statistics. Empirical evidence, indicating that the proposed procedures work well in finite samples and compare favorably with competing methods, is provided through a simulation study, and an application to annual temperature data.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"111 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135181369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Consistency of BIC Model Averaging BIC模型平均的一致性
3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202021.0145
Ze Chen, Jianqiang Zhang, Wangli Xu, Yuhong Yang
{"title":"Consistency of BIC Model Averaging","authors":"Ze Chen, Jianqiang Zhang, Wangli Xu, Yuhong Yang","doi":"10.5705/ss.202021.0145","DOIUrl":"https://doi.org/10.5705/ss.202021.0145","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"168-169 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135182929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Hypothesis Testing in High-Dimensional Instrumental Variables Regression With an Application to Genomics Data 高维工具变量回归中的假设检验及其在基因组学数据中的应用
3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202019.0408
Jiarui Lu, Hongzhe Li
Gene expression and phenotype association can be affected by potential unmeasured confounders from multiple sources, leading to biased estimates of the associations. Since genetic variants largely explain gene expression variations, they can be used as instruments in studying the association between gene expressions and phenotype in the framework of high dimensional instrumental variable (IV) regression. However, because the dimensions of both genetic variants and gene expressions are often larger than the sample size, statistical inferences such as hypothesis testing for such high dimensional IV models are not trivial and have not been investigated in literature. The problem is more challenging since the instrumental variables (e.g., genetic variants) have to be selected among a large set of genetic variants. This paper considers the problem of hypothesis testing for sparse IV regression models and presents methods for testing single regression coefficient and multiple testing of multiple coefficients, where the test statistic for each single coefficient is constructed based on an inverse regression. A multiple testing procedure is developed for selecting variables and is shown to control the false discovery rate. Simulations are conducted to evaluate the performance of our proposed methods. These methods are illustrated by an analysis of a yeast dataset in order to identify genes that are associated with growth in the presence of hydrogen peroxide.
基因表达和表型关联可能受到来自多个来源的潜在未测量混杂因素的影响,导致对关联的估计有偏倚。由于遗传变异在很大程度上解释了基因表达的变化,它们可以作为在高维工具变量(IV)回归框架下研究基因表达与表型之间关系的工具。然而,由于遗传变异和基因表达的维度往往大于样本量,因此对这种高维IV模型进行假设检验等统计推断并非微不足道,尚未在文献中进行研究。这个问题更具挑战性,因为工具变量(例如,遗传变异)必须在大量遗传变异中进行选择。本文考虑稀疏IV回归模型的假设检验问题,提出了单回归系数检验和多系数的多重检验方法,其中每个单系数的检验统计量是基于逆回归构造的。开发了一个多重测试程序来选择变量,并被证明可以控制错误发现率。通过仿真来评估我们提出的方法的性能。这些方法通过酵母数据集的分析来说明,以确定在过氧化氢存在下与生长相关的基因。
{"title":"Hypothesis Testing in High-Dimensional Instrumental Variables Regression With an Application to Genomics Data","authors":"Jiarui Lu, Hongzhe Li","doi":"10.5705/ss.202019.0408","DOIUrl":"https://doi.org/10.5705/ss.202019.0408","url":null,"abstract":"Gene expression and phenotype association can be affected by potential unmeasured confounders from multiple sources, leading to biased estimates of the associations. Since genetic variants largely explain gene expression variations, they can be used as instruments in studying the association between gene expressions and phenotype in the framework of high dimensional instrumental variable (IV) regression. However, because the dimensions of both genetic variants and gene expressions are often larger than the sample size, statistical inferences such as hypothesis testing for such high dimensional IV models are not trivial and have not been investigated in literature. The problem is more challenging since the instrumental variables (e.g., genetic variants) have to be selected among a large set of genetic variants. This paper considers the problem of hypothesis testing for sparse IV regression models and presents methods for testing single regression coefficient and multiple testing of multiple coefficients, where the test statistic for each single coefficient is constructed based on an inverse regression. A multiple testing procedure is developed for selecting variables and is shown to control the false discovery rate. Simulations are conducted to evaluate the performance of our proposed methods. These methods are illustrated by an analysis of a yeast dataset in order to identify genes that are associated with growth in the presence of hydrogen peroxide.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"53 60 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135783365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse Functional Principal Component Analysis in High Dimensions 高维稀疏泛函主成分分析
3区 数学 Q2 STATISTICS & PROBABILITY Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0445
Xiaoyu Hu, Fang Yao
Functional principal component analysis (FPCA) is a fundamental tool and has attracted increasing attention in recent decades, while existing methods are restricted to data with a single or finite number of random functions (much smaller than the sample size $n$). In this work, we focus on high-dimensional functional processes where the number of random functions $p$ is comparable to, or even much larger than $n$. Such data are ubiquitous in various fields such as neuroimaging analysis, and cannot be properly modeled by existing methods. We propose a new algorithm, called sparse FPCA, which is able to model principal eigenfunctions effectively under sensible sparsity regimes. While sparsity assumptions are standard in multivariate statistics, they have not been investigated in the complex context where not only is $p$ large, but also each variable itself is an intrinsically infinite-dimensional process. The sparsity structure motivates a thresholding rule that is easy to compute without nonparametric smoothing by exploiting the relationship between univariate orthonormal basis expansions and multivariate Kahunen-Lo`eve (K-L) representations. We investigate the theoretical properties of the resulting estimators, and illustrate the performance with simulated and real data examples.
功能主成分分析(Functional principal component analysis, FPCA)是一种基础工具,近几十年来受到越来越多的关注,而现有的方法仅限于具有单个或有限数量的随机函数(远小于样本量)的数据。在这项工作中,我们专注于高维函数过程,其中随机函数的数量p与n相当,甚至远远大于n。这些数据在神经影像分析等各个领域都无处不在,无法用现有方法正确建模。我们提出了一种新的算法,称为稀疏FPCA,它能够在显稀疏性条件下有效地建模主特征函数。虽然稀疏性假设在多元统计中是标准的,但它们并没有在复杂的环境中进行研究,在这种环境中,不仅$p$大,而且每个变量本身本质上是一个无限维的过程。通过利用单变量正交基展开式和多元Kahunen-Lo ' eve (K-L)表示之间的关系,稀疏性结构激发了一种无需非参数平滑即可轻松计算的阈值规则。我们研究了所得到的估计器的理论性质,并用模拟和实际数据实例说明了其性能。
{"title":"Sparse Functional Principal Component Analysis in High Dimensions","authors":"Xiaoyu Hu, Fang Yao","doi":"10.5705/ss.202020.0445","DOIUrl":"https://doi.org/10.5705/ss.202020.0445","url":null,"abstract":"Functional principal component analysis (FPCA) is a fundamental tool and has attracted increasing attention in recent decades, while existing methods are restricted to data with a single or finite number of random functions (much smaller than the sample size $n$). In this work, we focus on high-dimensional functional processes where the number of random functions $p$ is comparable to, or even much larger than $n$. Such data are ubiquitous in various fields such as neuroimaging analysis, and cannot be properly modeled by existing methods. We propose a new algorithm, called sparse FPCA, which is able to model principal eigenfunctions effectively under sensible sparsity regimes. While sparsity assumptions are standard in multivariate statistics, they have not been investigated in the complex context where not only is $p$ large, but also each variable itself is an intrinsically infinite-dimensional process. The sparsity structure motivates a thresholding rule that is easy to compute without nonparametric smoothing by exploiting the relationship between univariate orthonormal basis expansions and multivariate Kahunen-Lo`eve (K-L) representations. We investigate the theoretical properties of the resulting estimators, and illustrate the performance with simulated and real data examples.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136092729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Statistica Sinica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1