首页 > 最新文献

Statistica Sinica最新文献

英文 中文
Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the AFT Model AFT模型下具有事件时间结果的协变量自适应随机临床试验的假设检验
IF 1.4 3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0011
Jianling Wang, Thuan Nguyen, Y. Luan, Jiming Jiang
Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the
基于事件时间结果的协变量自适应随机临床试验的假设检验
{"title":"Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the AFT Model","authors":"Jianling Wang, Thuan Nguyen, Y. Luan, Jiming Jiang","doi":"10.5705/ss.202022.0011","DOIUrl":"https://doi.org/10.5705/ss.202022.0011","url":null,"abstract":"Testing Hypotheses of Covariate-Adaptive Randomized Clinical Trials with Time-to-event Outcomes under the","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70937862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement Learning via Nonparametric Smoothing in a Continuous-Time Stochastic Setting with Noisy Data 基于非参数平滑的连续时间随机噪声环境下的强化学习
IF 1.4 3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0407
Chenyang Jiang, Bowen Hu, Yazhen Wang, Shang Wu
{"title":"Reinforcement Learning via Nonparametric Smoothing in a Continuous-Time Stochastic Setting with Noisy Data","authors":"Chenyang Jiang, Bowen Hu, Yazhen Wang, Shang Wu","doi":"10.5705/ss.202022.0407","DOIUrl":"https://doi.org/10.5705/ss.202022.0407","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70940184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hypotheses Testing of Functional Principal Components 功能主成分的假设检验
IF 1.4 3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0309
Zening Song, Lijian Yang, Yuanyuan Zhang
: We propose a test for the hypothesis that the standardized functional principal components (FPCs) of functional data are equal to a given set of orthonormal bases (e.g., the Fourier basis). Using estimates of individual trajectories that satisfy certain approximation conditions, we construct a chi-square-type statistic, and show that it is oracally e(cid:14)cient under the null hypothesis, in the sense that its limiting distribution is the same as that of an infeasible statistic using all trajectories, known as the oracle." The null limiting distribution is an in(cid:12)nite Gaussian quadratic form, and we obtain a consistent estimator of its quantile. A test statistic based on the chi-squared-type statistic and the approximate quantile of the Gaussian quadratic form is shown to be both of the nominal asymptotic signi(cid:12)cance level and asymptotically correct. It is further shown that B-spline trajectory estimates meet the required approximation conditions. Simulation studies demonstrate the superior (cid:12)nite-sample performance of the proposed testing procedure. Using electroencephalogram (EEG) data, the proposed procedure con(cid:12)rms an interesting discovery that the centered EEG data are generated from a small
我们提出了一个假设的检验,即功能数据的标准化功能主成分(FPCs)等于给定的一组标准正交基(例如,傅里叶基)。使用满足某些近似条件的单个轨迹的估计,我们构造了一个凿方型统计量,并表明它在零假设下是口头有效的,因为它的极限分布与使用所有轨迹的不可行的统计量的极限分布相同,称为“神谕”。零极限分布是一个无限高斯二次型分布,得到了其分位数的一致估计。基于高斯二次型的近似分位数和轮廓型统计量的检验统计量既具有名义渐近显著性水平,又具有渐近正确性。进一步证明了b样条轨迹估计满足所需的近似条件。仿真研究表明,所提出的测试方法具有良好的有限样本性能。利用脑电图(EEG)数据,所提出的程序证实了一个有趣的发现,即集中的EEG数据是由一个小的Statistica Sinica生成的
{"title":"Hypotheses Testing of Functional Principal Components","authors":"Zening Song, Lijian Yang, Yuanyuan Zhang","doi":"10.5705/ss.202022.0309","DOIUrl":"https://doi.org/10.5705/ss.202022.0309","url":null,"abstract":": We propose a test for the hypothesis that the standardized functional principal components (FPCs) of functional data are equal to a given set of orthonormal bases (e.g., the Fourier basis). Using estimates of individual trajectories that satisfy certain approximation conditions, we construct a chi-square-type statistic, and show that it is oracally e(cid:14)cient under the null hypothesis, in the sense that its limiting distribution is the same as that of an infeasible statistic using all trajectories, known as the oracle.\" The null limiting distribution is an in(cid:12)nite Gaussian quadratic form, and we obtain a consistent estimator of its quantile. A test statistic based on the chi-squared-type statistic and the approximate quantile of the Gaussian quadratic form is shown to be both of the nominal asymptotic signi(cid:12)cance level and asymptotically correct. It is further shown that B-spline trajectory estimates meet the required approximation conditions. Simulation studies demonstrate the superior (cid:12)nite-sample performance of the proposed testing procedure. Using electroencephalogram (EEG) data, the proposed procedure con(cid:12)rms an interesting discovery that the centered EEG data are generated from a small","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70940204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Hypothesis Testing in High-Dimensional Instrumental Variables Regression With an Application to Genomics Data 高维工具变量回归中的假设检验及其在基因组学数据中的应用
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202019.0408
Jiarui Lu, Hongzhe Li
Gene expression and phenotype association can be affected by potential unmeasured confounders from multiple sources, leading to biased estimates of the associations. Since genetic variants largely explain gene expression variations, they can be used as instruments in studying the association between gene expressions and phenotype in the framework of high dimensional instrumental variable (IV) regression. However, because the dimensions of both genetic variants and gene expressions are often larger than the sample size, statistical inferences such as hypothesis testing for such high dimensional IV models are not trivial and have not been investigated in literature. The problem is more challenging since the instrumental variables (e.g., genetic variants) have to be selected among a large set of genetic variants. This paper considers the problem of hypothesis testing for sparse IV regression models and presents methods for testing single regression coefficient and multiple testing of multiple coefficients, where the test statistic for each single coefficient is constructed based on an inverse regression. A multiple testing procedure is developed for selecting variables and is shown to control the false discovery rate. Simulations are conducted to evaluate the performance of our proposed methods. These methods are illustrated by an analysis of a yeast dataset in order to identify genes that are associated with growth in the presence of hydrogen peroxide.
基因表达和表型关联可能受到来自多个来源的潜在未测量混杂因素的影响,导致对关联的估计有偏倚。由于遗传变异在很大程度上解释了基因表达的变化,它们可以作为在高维工具变量(IV)回归框架下研究基因表达与表型之间关系的工具。然而,由于遗传变异和基因表达的维度往往大于样本量,因此对这种高维IV模型进行假设检验等统计推断并非微不足道,尚未在文献中进行研究。这个问题更具挑战性,因为工具变量(例如,遗传变异)必须在大量遗传变异中进行选择。本文考虑稀疏IV回归模型的假设检验问题,提出了单回归系数检验和多系数的多重检验方法,其中每个单系数的检验统计量是基于逆回归构造的。开发了一个多重测试程序来选择变量,并被证明可以控制错误发现率。通过仿真来评估我们提出的方法的性能。这些方法通过酵母数据集的分析来说明,以确定在过氧化氢存在下与生长相关的基因。
{"title":"Hypothesis Testing in High-Dimensional Instrumental Variables Regression With an Application to Genomics Data","authors":"Jiarui Lu, Hongzhe Li","doi":"10.5705/ss.202019.0408","DOIUrl":"https://doi.org/10.5705/ss.202019.0408","url":null,"abstract":"Gene expression and phenotype association can be affected by potential unmeasured confounders from multiple sources, leading to biased estimates of the associations. Since genetic variants largely explain gene expression variations, they can be used as instruments in studying the association between gene expressions and phenotype in the framework of high dimensional instrumental variable (IV) regression. However, because the dimensions of both genetic variants and gene expressions are often larger than the sample size, statistical inferences such as hypothesis testing for such high dimensional IV models are not trivial and have not been investigated in literature. The problem is more challenging since the instrumental variables (e.g., genetic variants) have to be selected among a large set of genetic variants. This paper considers the problem of hypothesis testing for sparse IV regression models and presents methods for testing single regression coefficient and multiple testing of multiple coefficients, where the test statistic for each single coefficient is constructed based on an inverse regression. A multiple testing procedure is developed for selecting variables and is shown to control the false discovery rate. Simulations are conducted to evaluate the performance of our proposed methods. These methods are illustrated by an analysis of a yeast dataset in order to identify genes that are associated with growth in the presence of hydrogen peroxide.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135783365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Two-Sample Tests for Relevant Differences in the Eigenfunctions of Covariance Operators 协方差算子特征函数相关差异的双样本检验
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0365
Alexander Aue, Holger Dette, Gregory Rice
This paper deals with two-sample tests for functional time series data, which have become widely available in conjunction with the advent of modern complex observation systems. Here, particular interest is in evaluating whether two sets of functional time series observations share the shape of their primary modes of variation as encoded by the eigenfunctions of the respective covariance operators. To this end, a novel testing approach is introduced that connects with, and extends, existing literature in two main ways. First, tests are set up in the relevant testing framework, where interest is not in testing an exact null hypothesis but rather in detecting deviations deemed sufficiently relevant, with relevance determined by the practitioner and perhaps guided by domain experts. Second, the proposed test statistics rely on a self-normalization principle that helps to avoid the notoriously difficult task of estimating the long-run covariance structure of the underlying functional time series. The main theoretical result of this paper is the derivation of the large-sample behavior of the proposed test statistics. Empirical evidence, indicating that the proposed procedures work well in finite samples and compare favorably with competing methods, is provided through a simulation study, and an application to annual temperature data.
本文讨论了随着现代复杂观测系统的出现而广泛应用的功能时间序列数据的双样本检验。在这里,特别感兴趣的是评估两组函数时间序列观测是否共享由各自协方差算子的特征函数编码的其主要变化模式的形状。为此,介绍了一种新的测试方法,该方法以两种主要方式连接并扩展了现有文献。首先,在相关的测试框架中设置测试,其中的兴趣不是测试精确的零假设,而是检测被认为足够相关的偏差,由从业者确定相关性,并可能由领域专家指导。其次,所提出的测试统计依赖于自归一化原则,这有助于避免估计潜在功能时间序列的长期协方差结构这一众所周知的困难任务。本文的主要理论结果是推导了所提出的检验统计量的大样本行为。通过模拟研究和对年温度数据的应用,提供了经验证据,表明所提出的方法在有限样本中效果良好,并且与竞争方法相比具有优势。
{"title":"Two-Sample Tests for Relevant Differences in the Eigenfunctions of Covariance Operators","authors":"Alexander Aue, Holger Dette, Gregory Rice","doi":"10.5705/ss.202020.0365","DOIUrl":"https://doi.org/10.5705/ss.202020.0365","url":null,"abstract":"This paper deals with two-sample tests for functional time series data, which have become widely available in conjunction with the advent of modern complex observation systems. Here, particular interest is in evaluating whether two sets of functional time series observations share the shape of their primary modes of variation as encoded by the eigenfunctions of the respective covariance operators. To this end, a novel testing approach is introduced that connects with, and extends, existing literature in two main ways. First, tests are set up in the relevant testing framework, where interest is not in testing an exact null hypothesis but rather in detecting deviations deemed sufficiently relevant, with relevance determined by the practitioner and perhaps guided by domain experts. Second, the proposed test statistics rely on a self-normalization principle that helps to avoid the notoriously difficult task of estimating the long-run covariance structure of the underlying functional time series. The main theoretical result of this paper is the derivation of the large-sample behavior of the proposed test statistics. Empirical evidence, indicating that the proposed procedures work well in finite samples and compare favorably with competing methods, is provided through a simulation study, and an application to annual temperature data.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135181369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Consistency of BIC Model Averaging BIC模型平均的一致性
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202021.0145
Ze Chen, Jianqiang Zhang, Wangli Xu, Yuhong Yang
{"title":"Consistency of BIC Model Averaging","authors":"Ze Chen, Jianqiang Zhang, Wangli Xu, Yuhong Yang","doi":"10.5705/ss.202021.0145","DOIUrl":"https://doi.org/10.5705/ss.202021.0145","url":null,"abstract":"","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135182929","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Sparse Functional Principal Component Analysis in High Dimensions 高维稀疏泛函主成分分析
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0445
Xiaoyu Hu, Fang Yao
Functional principal component analysis (FPCA) is a fundamental tool and has attracted increasing attention in recent decades, while existing methods are restricted to data with a single or finite number of random functions (much smaller than the sample size $n$). In this work, we focus on high-dimensional functional processes where the number of random functions $p$ is comparable to, or even much larger than $n$. Such data are ubiquitous in various fields such as neuroimaging analysis, and cannot be properly modeled by existing methods. We propose a new algorithm, called sparse FPCA, which is able to model principal eigenfunctions effectively under sensible sparsity regimes. While sparsity assumptions are standard in multivariate statistics, they have not been investigated in the complex context where not only is $p$ large, but also each variable itself is an intrinsically infinite-dimensional process. The sparsity structure motivates a thresholding rule that is easy to compute without nonparametric smoothing by exploiting the relationship between univariate orthonormal basis expansions and multivariate Kahunen-Lo`eve (K-L) representations. We investigate the theoretical properties of the resulting estimators, and illustrate the performance with simulated and real data examples.
功能主成分分析(Functional principal component analysis, FPCA)是一种基础工具,近几十年来受到越来越多的关注,而现有的方法仅限于具有单个或有限数量的随机函数(远小于样本量)的数据。在这项工作中,我们专注于高维函数过程,其中随机函数的数量p与n相当,甚至远远大于n。这些数据在神经影像分析等各个领域都无处不在,无法用现有方法正确建模。我们提出了一种新的算法,称为稀疏FPCA,它能够在显稀疏性条件下有效地建模主特征函数。虽然稀疏性假设在多元统计中是标准的,但它们并没有在复杂的环境中进行研究,在这种环境中,不仅$p$大,而且每个变量本身本质上是一个无限维的过程。通过利用单变量正交基展开式和多元Kahunen-Lo ' eve (K-L)表示之间的关系,稀疏性结构激发了一种无需非参数平滑即可轻松计算的阈值规则。我们研究了所得到的估计器的理论性质,并用模拟和实际数据实例说明了其性能。
{"title":"Sparse Functional Principal Component Analysis in High Dimensions","authors":"Xiaoyu Hu, Fang Yao","doi":"10.5705/ss.202020.0445","DOIUrl":"https://doi.org/10.5705/ss.202020.0445","url":null,"abstract":"Functional principal component analysis (FPCA) is a fundamental tool and has attracted increasing attention in recent decades, while existing methods are restricted to data with a single or finite number of random functions (much smaller than the sample size $n$). In this work, we focus on high-dimensional functional processes where the number of random functions $p$ is comparable to, or even much larger than $n$. Such data are ubiquitous in various fields such as neuroimaging analysis, and cannot be properly modeled by existing methods. We propose a new algorithm, called sparse FPCA, which is able to model principal eigenfunctions effectively under sensible sparsity regimes. While sparsity assumptions are standard in multivariate statistics, they have not been investigated in the complex context where not only is $p$ large, but also each variable itself is an intrinsically infinite-dimensional process. The sparsity structure motivates a thresholding rule that is easy to compute without nonparametric smoothing by exploiting the relationship between univariate orthonormal basis expansions and multivariate Kahunen-Lo`eve (K-L) representations. We investigate the theoretical properties of the resulting estimators, and illustrate the performance with simulated and real data examples.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136092729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Compound Sequential Change-point Detection in Parallel Data Streams 并行数据流中的复合顺序变更点检测
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0508
Yunxiao Chen, Xiaoou Li
We consider sequential change-point detection in parallel data streams, where each stream has its own change point. Once a change is detected in a data stream, this stream is deactivated permanently. The goal is to maximize the normal operation of the pre-change streams, while controlling the proportion of post-change streams among the active streams at all time points. Taking a Bayesian formulation, we develop a compound decision framework for this problem. A procedure is proposed that is uniformly optimal among all sequential procedures which control the expected proportion of post-change streams at all time points. We also investigate the asymptotic behavior of the proposed method when the number of data streams grows large. Numerical examples are provided to illustrate the use and performance of the proposed method.
我们考虑并行数据流中的顺序更改点检测,其中每个流都有自己的更改点。一旦在数据流中检测到更改,该流将永久停用。目标是最大化更改前流的正常运行,同时控制更改后流在所有时间点活动流中的比例。采用贝叶斯公式,我们开发了一个复合决策框架来解决这个问题。提出了一个在所有顺序过程中统一最优的过程,该过程控制了所有时间点的变更后流的预期比例。我们还研究了该方法在数据流数量增大时的渐近性。数值算例说明了该方法的应用和性能。
{"title":"Compound Sequential Change-point Detection in Parallel Data Streams","authors":"Yunxiao Chen, Xiaoou Li","doi":"10.5705/ss.202020.0508","DOIUrl":"https://doi.org/10.5705/ss.202020.0508","url":null,"abstract":"We consider sequential change-point detection in parallel data streams, where each stream has its own change point. Once a change is detected in a data stream, this stream is deactivated permanently. The goal is to maximize the normal operation of the pre-change streams, while controlling the proportion of post-change streams among the active streams at all time points. Taking a Bayesian formulation, we develop a compound decision framework for this problem. A procedure is proposed that is uniformly optimal among all sequential procedures which control the expected proportion of post-change streams at all time points. We also investigate the asymptotic behavior of the proposed method when the number of data streams grows large. Numerical examples are provided to illustrate the use and performance of the proposed method.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136297313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Mean Tests For High-dimensional Time Series 高维时间序列的均值检验
IF 1.4 3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202022.0147
Shuyi Zhang, Songxi Chen, Yumou Qiu
: This paper considers testing for two-sample mean difference with high-dimensional temporally dependent data, which is later extended to the one-sample situation. To eliminate the bias caused by the temporal dependence among the time series observations, a band-excluded U-statistic (BEU) is proposed to estimate the squared Euclidean distance between the two means, which excludes cross-products of data vectors among temporally close time points. The asymptotic normality of the BEU statistic is derived under the high-dimensional setting with “spatial” (column-wise) and temporal dependence. An estimator built on the kernel smoothed cross-time covariances is developed to estimate the variance of the BEU-statistic, which facilitates a test procedure based on the standardized BEU-statistic. The proposed test is nonparametric and adaptive to a wide range of dependence and dimensionality, and has attractive power properties relative to a self-normalized test. Numerical simulation and a real data analysis on the return and volatility of S&P 500 stocks before and after the 2008 financial crisis are conducted to demonstrate the performance and utility of the proposed test.
本文考虑了高维时变数据的两样本均值差检验,并将其推广到单样本情况。为了消除时间序列观测值之间的时间相关性所造成的偏差,提出了一种排除频带的u统计量(BEU)来估计两个均值之间的欧几里得距离的平方,该方法排除了时间上接近的时间点之间数据向量的交叉积。在具有“空间”(列方向)和时间依赖性的高维设置下,导出了BEU统计量的渐近正态性。提出了一种基于核平滑跨时间协方差的估计器来估计beu统计量的方差,从而简化了基于标准化beu统计量的测试过程。所提出的测试是非参数的,可适应大范围的依赖和维度,并且相对于自归一化测试具有吸引力的功率特性。通过对2008年金融危机前后标准普尔500指数股票收益率和波动性的数值模拟和实际数据分析,证明了所提出的检验方法的有效性和实用性。
{"title":"Mean Tests For High-dimensional Time Series","authors":"Shuyi Zhang, Songxi Chen, Yumou Qiu","doi":"10.5705/ss.202022.0147","DOIUrl":"https://doi.org/10.5705/ss.202022.0147","url":null,"abstract":": This paper considers testing for two-sample mean difference with high-dimensional temporally dependent data, which is later extended to the one-sample situation. To eliminate the bias caused by the temporal dependence among the time series observations, a band-excluded U-statistic (BEU) is proposed to estimate the squared Euclidean distance between the two means, which excludes cross-products of data vectors among temporally close time points. The asymptotic normality of the BEU statistic is derived under the high-dimensional setting with “spatial” (column-wise) and temporal dependence. An estimator built on the kernel smoothed cross-time covariances is developed to estimate the variance of the BEU-statistic, which facilitates a test procedure based on the standardized BEU-statistic. The proposed test is nonparametric and adaptive to a wide range of dependence and dimensionality, and has attractive power properties relative to a self-normalized test. Numerical simulation and a real data analysis on the return and volatility of S&P 500 stocks before and after the 2008 financial crisis are conducted to demonstrate the performance and utility of the proposed test.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":1.4,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"70938742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Nonparametric Maximum Likelihood Estimation Under a Likelihood Ratio Order 似然比阶下的非参数最大似然估计
3区 数学 Q2 Mathematics Pub Date : 2023-01-01 DOI: 10.5705/ss.202020.0207
Ted Westling, Kevin J. Downes, Dylan S. Small
Comparison of two univariate distributions based on independent samples from them is a fundamental problem in statistics, with applications in a wide variety of scientific disciplines. In many situations, we might hypothesize that the two distributions are stochastically ordered, meaning intuitively that samples from one distribution tend to be larger than those from the other. One type of stochastic order that arises in economics, biomedicine, and elsewhere is the likelihood ratio order, also known as the density ratio order, in which the ratio of the density functions of the two distributions is monotone non-decreasing. In this article, we derive and study the nonparametric maximum likelihood estimator of the individual distributions and the ratio of their densities under the likelihood ratio order. Our work applies to discrete distributions, continuous distributions, and mixed continuous-discrete distributions. We demonstrate convergence in distribution of the estimator in certain cases, and we illustrate our results using numerical experiments and an analysis of a biomarker for predicting bacterial infection in children with systemic inflammatory response syndrome.
基于独立样本的两个单变量分布的比较是统计学中的一个基本问题,在各种科学学科中都有广泛的应用。在许多情况下,我们可能会假设这两个分布是随机有序的,这意味着直觉上,一个分布的样本往往比另一个分布的样本大。在经济学、生物医学和其他领域出现的一种随机顺序是似然比顺序,也称为密度比顺序,其中两个分布的密度函数的比率是单调的,不减小的。本文导出并研究了似然比阶下单个分布及其密度之比的非参数极大似然估计量。我们的工作适用于离散分布、连续分布和连续-离散混合分布。我们在某些情况下证明了估计量分布的收敛性,并且我们使用数值实验和对预测全身性炎症反应综合征儿童细菌感染的生物标志物的分析来说明我们的结果。
{"title":"Nonparametric Maximum Likelihood Estimation Under a Likelihood Ratio Order","authors":"Ted Westling, Kevin J. Downes, Dylan S. Small","doi":"10.5705/ss.202020.0207","DOIUrl":"https://doi.org/10.5705/ss.202020.0207","url":null,"abstract":"Comparison of two univariate distributions based on independent samples from them is a fundamental problem in statistics, with applications in a wide variety of scientific disciplines. In many situations, we might hypothesize that the two distributions are stochastically ordered, meaning intuitively that samples from one distribution tend to be larger than those from the other. One type of stochastic order that arises in economics, biomedicine, and elsewhere is the likelihood ratio order, also known as the density ratio order, in which the ratio of the density functions of the two distributions is monotone non-decreasing. In this article, we derive and study the nonparametric maximum likelihood estimator of the individual distributions and the ratio of their densities under the likelihood ratio order. Our work applies to discrete distributions, continuous distributions, and mixed continuous-discrete distributions. We demonstrate convergence in distribution of the estimator in certain cases, and we illustrate our results using numerical experiments and an analysis of a biomarker for predicting bacterial infection in children with systemic inflammatory response syndrome.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136008744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Statistica Sinica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1