首页 > 最新文献

Electronic Journal of Statistics最新文献

英文 中文
On the choice of the splitting ratio for the split likelihood ratio test 关于分裂似然比检验中分裂比的选择
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2099
David Strieder, M. Drton
The recently introduced framework of universal inference provides a new approach to constructing hypothesis tests and confidence regions that are valid in finite samples and do not rely on any specific regularity assumptions on the underlying statistical model. At the core of the methodology is a split likelihood ratio statistic, which is formed under data splitting and compared to a cleverly selected universal critical value. As this critical value can be very conservative, it is interesting to mitigate the potential loss of power by careful choice of the ratio according to which data are split. Motivated by this problem, we study the split likelihood ratio test under local alternatives and introduce the resulting class of noncentral split chi-square distributions. We investigate the properties of this new class of distributions and use it to numerically examine and propose an optimal choice of the data splitting ratio for tests of composite hypotheses of different dimensions.
最近引入的通用推理框架提供了一种新的方法来构建在有限样本中有效的假设检验和置信区,并且不依赖于对基础统计模型的任何特定规则性假设。该方法的核心是分裂似然比统计,它是在数据分裂下形成的,并与巧妙选择的通用临界值进行比较。由于这个临界值可能非常保守,因此通过仔细选择数据分割的比率来减轻潜在的功率损失是很有趣的。受此问题的启发,我们研究了局部备选方案下的分裂似然比检验,并引入了由此产生的一类非中心分裂卡方分布。我们研究了这类新分布的性质,并用它对不同维度的复合假设进行了数值检验,提出了数据分割率的最优选择。
{"title":"On the choice of the splitting ratio for the split likelihood ratio test","authors":"David Strieder, M. Drton","doi":"10.1214/22-ejs2099","DOIUrl":"https://doi.org/10.1214/22-ejs2099","url":null,"abstract":"The recently introduced framework of universal inference provides a new approach to constructing hypothesis tests and confidence regions that are valid in finite samples and do not rely on any specific regularity assumptions on the underlying statistical model. At the core of the methodology is a split likelihood ratio statistic, which is formed under data splitting and compared to a cleverly selected universal critical value. As this critical value can be very conservative, it is interesting to mitigate the potential loss of power by careful choice of the ratio according to which data are split. Motivated by this problem, we study the split likelihood ratio test under local alternatives and introduce the resulting class of noncentral split chi-square distributions. We investigate the properties of this new class of distributions and use it to numerically examine and propose an optimal choice of the data splitting ratio for tests of composite hypotheses of different dimensions.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48549640","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Generalized maximum likelihood estimation of the mean of parameters of mixtures. With applications to sampling and to observational studies 混合参数均值的广义极大似然估计。应用于抽样和观察研究
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2082
E. Greenshtein, Ya'acov Ritov
{"title":"Generalized maximum likelihood estimation of the mean of parameters of mixtures. With applications to sampling and to observational studies","authors":"E. Greenshtein, Ya'acov Ritov","doi":"10.1214/22-ejs2082","DOIUrl":"https://doi.org/10.1214/22-ejs2082","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49008194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Multi-sample comparison using spatial signs for infinite dimensional data 使用空间符号对无限维数据进行多样本比较
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2054
Joydeep Chowdhury, P. Chaudhuri
We consider an analysis of variance type problem, where the sample observations are random elements in an infinite dimensional space. This scenario covers the case, where the observations are random functions. For such a problem, we propose a test based on spatial signs. We develop an asymptotic implementation as well as a bootstrap implementation and a permutation implementation of this test and investigate their size and power properties. We compare the performance of our test with that of several mean based tests of analysis of variance for functional data studied in the literature. Interestingly, our test not only outperforms the mean based tests in several non-Gaussian models with heavy tails or skewed distributions, but in some Gaussian models also. Further, we also compare the performance of our test with the mean based tests in several models involving contaminated probability distributions. Finally, we demonstrate the performance of these tests in three real datasets: a Canadian weather dataset, a spectrometric dataset on chemical analysis of meat samples and a dataset on orthotic measurements on volunteers.
我们考虑方差分析型问题,其中样本观测是无限维空间中的随机元素。该场景涵盖了观测值为随机函数的情况。对于这样一个问题,我们提出了一个基于空间符号的测试。我们开发了该测试的渐近实现、bootstrap实现和置换实现,并研究了它们的大小和幂性质。我们将我们的测试与文献中研究的函数数据的方差分析的几种基于均值的测试的性能进行了比较。有趣的是,我们的测试不仅在几个具有重尾或偏斜分布的非高斯模型中优于基于平均值的测试,而且在一些高斯模型中也优于基于均值的测试。此外,我们还比较了我们的测试与几个涉及污染概率分布的模型中基于均值的测试的性能。最后,我们在三个真实数据集中展示了这些测试的性能:一个是加拿大天气数据集,一个是肉类样本化学分析的光谱数据集,另一个是志愿者的正交测量数据集。
{"title":"Multi-sample comparison using spatial signs for infinite dimensional data","authors":"Joydeep Chowdhury, P. Chaudhuri","doi":"10.1214/22-ejs2054","DOIUrl":"https://doi.org/10.1214/22-ejs2054","url":null,"abstract":"We consider an analysis of variance type problem, where the sample observations are random elements in an infinite dimensional space. This scenario covers the case, where the observations are random functions. For such a problem, we propose a test based on spatial signs. We develop an asymptotic implementation as well as a bootstrap implementation and a permutation implementation of this test and investigate their size and power properties. We compare the performance of our test with that of several mean based tests of analysis of variance for functional data studied in the literature. Interestingly, our test not only outperforms the mean based tests in several non-Gaussian models with heavy tails or skewed distributions, but in some Gaussian models also. Further, we also compare the performance of our test with the mean based tests in several models involving contaminated probability distributions. Finally, we demonstrate the performance of these tests in three real datasets: a Canadian weather dataset, a spectrometric dataset on chemical analysis of meat samples and a dataset on orthotic measurements on volunteers.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47773713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Truncated sum-of-squares estimation of fractional time series models with generalized power law trend 具有广义幂律趋势的分数阶时间序列模型的截断平方和估计
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2009
J. Hualde, M. Nielsen
Abstract: We consider truncated (or conditional) sum-of-squares estimation of a parametric fractional time series model with an additive deterministic structure. The latter consists of both a drift term and a generalized power law trend. The memory parameter of the stochastic component and the power parameter of the deterministic trend component are both considered unknown real numbers to be estimated and belonging to arbitrarily large compact sets. Thus, our model captures different forms of nonstationarity and noninvertibility as well as a very flexible deterministic specification. As in related settings, the proof of consistency (which is a prerequisite for proving asymptotic normality) is challenging due to non-uniform convergence of the objective function over a large admissible parameter space and due to the competition between stochastic and deterministic components. As expected, parameter estimates related to the deterministic component are shown to be consistent and asymptotically normal only for parts of the parameter space depending on the relative strength of the stochastic and deterministic components. In contrast, we establish consistency and asymptotic normality of parameter estimates related to the stochastic component for the entire parameter space. Furthermore, the asymptotic distribution of the latter estimates is unaffected by the presence of the deterministic component, even when this is not consistently estimable. We also include Monte Carlo simulations to illustrate our results.
摘要:研究具有加性确定性结构的参数分数阶时间序列模型的截断(或条件)平方和估计。后者包括漂移项和广义幂律趋势。随机分量的记忆参数和确定性趋势分量的功率参数都被认为是可估计的未知实数,它们属于任意大的紧集。因此,我们的模型捕获了不同形式的非平稳性和不可逆性,以及非常灵活的确定性规范。在相关设置中,一致性的证明(这是证明渐近正态性的先决条件)是具有挑战性的,因为目标函数在一个大的可接受参数空间上的非一致收敛,并且由于随机和确定性成分之间的竞争。正如预期的那样,与确定性分量相关的参数估计仅对部分参数空间显示为一致和渐近正态,这取决于随机分量和确定性分量的相对强度。相反,我们建立了与整个参数空间的随机分量相关的参数估计的一致性和渐近正态性。此外,后者估计的渐近分布不受确定性成分存在的影响,即使这不是一致可估计的。我们还包括蒙特卡罗模拟来说明我们的结果。
{"title":"Truncated sum-of-squares estimation of fractional time series models with generalized power law trend","authors":"J. Hualde, M. Nielsen","doi":"10.1214/22-ejs2009","DOIUrl":"https://doi.org/10.1214/22-ejs2009","url":null,"abstract":"Abstract: We consider truncated (or conditional) sum-of-squares estimation of a parametric fractional time series model with an additive deterministic structure. The latter consists of both a drift term and a generalized power law trend. The memory parameter of the stochastic component and the power parameter of the deterministic trend component are both considered unknown real numbers to be estimated and belonging to arbitrarily large compact sets. Thus, our model captures different forms of nonstationarity and noninvertibility as well as a very flexible deterministic specification. As in related settings, the proof of consistency (which is a prerequisite for proving asymptotic normality) is challenging due to non-uniform convergence of the objective function over a large admissible parameter space and due to the competition between stochastic and deterministic components. As expected, parameter estimates related to the deterministic component are shown to be consistent and asymptotically normal only for parts of the parameter space depending on the relative strength of the stochastic and deterministic components. In contrast, we establish consistency and asymptotic normality of parameter estimates related to the stochastic component for the entire parameter space. Furthermore, the asymptotic distribution of the latter estimates is unaffected by the presence of the deterministic component, even when this is not consistently estimable. We also include Monte Carlo simulations to illustrate our results.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45592747","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Optional Pólya trees: Posterior rates and uncertainty quantification 可选Pólya树:后验率和不确定性量化
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2086
I. Castillo, Thibault Randrianarisoa
{"title":"Optional Pólya trees: Posterior rates and uncertainty quantification","authors":"I. Castillo, Thibault Randrianarisoa","doi":"10.1214/22-ejs2086","DOIUrl":"https://doi.org/10.1214/22-ejs2086","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42508643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adaptive threshold-based classification of sparse high-dimensional data 基于自适应阈值的稀疏高维数据分类
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs1998
T. Pavlenko, N. Stepanova, Lee Thompson
Abstract: We revisit the problem of designing an efficient binary classifier in a challenging high-dimensional framework. The model under study assumes some local dependence structure among feature variables represented by a block-diagonal covariance matrix with a growing number of blocks of an arbitrary, but fixed size. The blocks correspond to non-overlapping independent groups of strongly correlated features. To assess the relevance of a particular block in predicting the response, we introduce a measure of “signal strength” pertaining to each feature block. This measure is then used to specify a sparse model of our interest. We further propose a threshold-based feature selector which operates as a screen-and-clean scheme integrated into a linear classifier: the data is subject to screening and hard threshold cleaning to filter out the blocks that contain no signals. Asymptotic properties of the proposed classifiers are studied when the sample size n depends on the number of feature blocks b, and the sample size goes to infinity with b at a slower rate than b. The new classifiers, which are fully adaptive to unknown parameters of the model, are shown to perform asymptotically optimally in a large part of the classification region. The numerical study confirms good analytical properties of the new classifiers that compare favorably to the existing threshold-based procedure used in a similar context.
摘要:我们重新审视了在一个具有挑战性的高维框架中设计一个高效的二进制分类器的问题。所研究的模型假设由块对角协方差矩阵表示的特征变量之间存在一些局部依赖结构,该矩阵具有不断增长的任意但固定大小的块。这些块对应于强相关特征的不重叠的独立组。为了评估特定块在预测响应中的相关性,我们引入了与每个特征块相关的“信号强度”度量。然后使用该度量来指定我们感兴趣的稀疏模型。我们进一步提出了一种基于阈值的特征选择器,它作为一种集成到线性分类器中的筛选和清理方案进行操作:对数据进行筛选和硬阈值清理,以过滤出不包含信号的块。当样本大小n取决于特征块b的数量,并且样本大小随b以比b慢的速率变为无穷大时,研究了所提出的分类器的渐近性质。新分类器完全自适应于模型的未知参数,在很大一部分分类区域中表现为渐近最优。数值研究证实了新分类器的良好分析性能,与在类似环境中使用的现有基于阈值的过程相比,这些分类器具有良好的分析性能。
{"title":"Adaptive threshold-based classification of sparse high-dimensional data","authors":"T. Pavlenko, N. Stepanova, Lee Thompson","doi":"10.1214/22-ejs1998","DOIUrl":"https://doi.org/10.1214/22-ejs1998","url":null,"abstract":"Abstract: We revisit the problem of designing an efficient binary classifier in a challenging high-dimensional framework. The model under study assumes some local dependence structure among feature variables represented by a block-diagonal covariance matrix with a growing number of blocks of an arbitrary, but fixed size. The blocks correspond to non-overlapping independent groups of strongly correlated features. To assess the relevance of a particular block in predicting the response, we introduce a measure of “signal strength” pertaining to each feature block. This measure is then used to specify a sparse model of our interest. We further propose a threshold-based feature selector which operates as a screen-and-clean scheme integrated into a linear classifier: the data is subject to screening and hard threshold cleaning to filter out the blocks that contain no signals. Asymptotic properties of the proposed classifiers are studied when the sample size n depends on the number of feature blocks b, and the sample size goes to infinity with b at a slower rate than b. The new classifiers, which are fully adaptive to unknown parameters of the model, are shown to perform asymptotically optimally in a large part of the classification region. The numerical study confirms good analytical properties of the new classifiers that compare favorably to the existing threshold-based procedure used in a similar context.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47611113","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimal estimation of the supremum and occupation times of a self-similar Lévy process 自相似Lévy过程的上确界和占用时间的最优估计
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/21-ejs1928
J. Ivanovs, M. Podolskij
In this paper we present new theoretical results on optimal estimation of certain random quantities based on high frequency observations of a Lévy process. More specifically, we investigate the asymptotic theory for the conditional mean and conditional median estimators of the supremum/infimum of a linear Brownian motion and a strictly stable Lévy process. Another contribution of our article is the conditional mean estimation of the local time and the occupation time of a linear Brownian motion. We demonstrate that the new estimators are considerably more efficient compared to the classical estimators studied in e.g. [6, 14, 29, 30, 38]. Furthermore, we discuss pre-estimation of the parameters of the underlying models, which is required for practical implementation of the proposed statistics. MSC2020 subject classifications: Primary 62M05, 62G20, 60F05; secondary 62G15, 60G18, 60G51.
在本文中,我们给出了基于Lévy过程的高频观测的某些随机量的最优估计的新的理论结果。更具体地说,我们研究了线性布朗运动和严格稳定Lévy过程的上确界/下确界的条件均值和条件中值估计的渐近理论。我们文章的另一个贡献是线性布朗运动的局部时间和占用时间的条件均值估计。我们证明,与[6,14,29,30,38]中研究的经典估计量相比,新的估计量要高效得多。此外,我们还讨论了基础模型参数的预估计,这是实际实现所提出的统计数据所必需的。MSC2020受试者分类:初级62M05、62G20、60F05;次级62G15、60G18、60G51。
{"title":"Optimal estimation of the supremum and occupation times of a self-similar Lévy process","authors":"J. Ivanovs, M. Podolskij","doi":"10.1214/21-ejs1928","DOIUrl":"https://doi.org/10.1214/21-ejs1928","url":null,"abstract":"In this paper we present new theoretical results on optimal estimation of certain random quantities based on high frequency observations of a Lévy process. More specifically, we investigate the asymptotic theory for the conditional mean and conditional median estimators of the supremum/infimum of a linear Brownian motion and a strictly stable Lévy process. Another contribution of our article is the conditional mean estimation of the local time and the occupation time of a linear Brownian motion. We demonstrate that the new estimators are considerably more efficient compared to the classical estimators studied in e.g. [6, 14, 29, 30, 38]. Furthermore, we discuss pre-estimation of the parameters of the underlying models, which is required for practical implementation of the proposed statistics. MSC2020 subject classifications: Primary 62M05, 62G20, 60F05; secondary 62G15, 60G18, 60G51.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43013455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Minimal σ-field for flexible sufficient dimension reduction 最小的σ-域为灵活的充分降维
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs1999
Hanmin Guo, Lin Hou, Y. Zhu
Sufficient Dimension Reduction (SDR) becomes an important tool for mitigating the curse of dimensionality in high dimensional regression analysis. Recently, Flexible SDR (FSDR) has been proposed to extend SDR by finding lower dimensional projections of transformed explanatory variables. The dimensions of the projections however cannot fully represent the extent of data reduction FSDR can achieve. As a consequence, optimality and other theoretical properties of FSDR are currently not well understood. In this article, we propose to use the σ-field associated with the projections, together with their dimensions to fully characterize FSDR, and refer to the σ-field as the FSDR σ-field. We further introduce the concept of minimal FSDR σ-field and consider FSDR projections with the minimal σfield optimal. Under some mild conditions, we show that the minimal FSDR σ-field exists, attaining the lowest dimensionality at the same time. To estimate the minimal FSDR σ-field, we propose a two-stage procedure called the Generalized Kernel Dimension Reduction (GKDR) method and partially establish its consistency property under weak conditions. Extensive simulation experiments demonstrate that the GKDRmethod can effectively find the minimal FSDR σ-field and outperform other existing methods. The application of GKDR to a real life air pollution data set sheds new light on the connections between atmospheric conditions and air quality. MSC2020 subject classifications: Primary 62B05; secondary 62J02.
充分降维(SDR)成为缓解高维回归分析中维数诅咒的重要工具。最近,柔性SDR(FSDR)被提出通过寻找转换的解释变量的低维投影来扩展SDR。然而,预测的维度不能完全代表FSDR可以实现的数据缩减程度。因此,FSDR的最优性和其他理论性质目前还没有得到很好的理解。在本文中,我们建议使用与投影相关的σ-场及其维度来完全表征FSDR,并将σ-场称为FSDRσ-场。我们进一步引入了最小FSDRσ场的概念,并考虑了具有最小σ场最优的FSDR投影。在一些温和的条件下,我们证明了最小FSDRσ-场的存在,同时达到了最低维。为了估计最小FSDRσ-场,我们提出了一种称为广义核降维(GKDR)方法的两阶段过程,并在弱条件下部分建立了它的一致性性质。大量的仿真实验表明,GKDR方法能够有效地找到最小FSDRσ场,并且优于现有的其他方法。GKDR在现实生活中的空气污染数据集中的应用为大气条件和空气质量之间的联系提供了新的线索。MSC2020受试者分类:初级62B05;次级62J02。
{"title":"Minimal σ-field for flexible sufficient dimension reduction","authors":"Hanmin Guo, Lin Hou, Y. Zhu","doi":"10.1214/22-ejs1999","DOIUrl":"https://doi.org/10.1214/22-ejs1999","url":null,"abstract":"Sufficient Dimension Reduction (SDR) becomes an important tool for mitigating the curse of dimensionality in high dimensional regression analysis. Recently, Flexible SDR (FSDR) has been proposed to extend SDR by finding lower dimensional projections of transformed explanatory variables. The dimensions of the projections however cannot fully represent the extent of data reduction FSDR can achieve. As a consequence, optimality and other theoretical properties of FSDR are currently not well understood. In this article, we propose to use the σ-field associated with the projections, together with their dimensions to fully characterize FSDR, and refer to the σ-field as the FSDR σ-field. We further introduce the concept of minimal FSDR σ-field and consider FSDR projections with the minimal σfield optimal. Under some mild conditions, we show that the minimal FSDR σ-field exists, attaining the lowest dimensionality at the same time. To estimate the minimal FSDR σ-field, we propose a two-stage procedure called the Generalized Kernel Dimension Reduction (GKDR) method and partially establish its consistency property under weak conditions. Extensive simulation experiments demonstrate that the GKDRmethod can effectively find the minimal FSDR σ-field and outperform other existing methods. The application of GKDR to a real life air pollution data set sheds new light on the connections between atmospheric conditions and air quality. MSC2020 subject classifications: Primary 62B05; secondary 62J02.","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46459761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation of partially conditional average treatment effect by double kernel-covariate balancing 用双核协变量平衡估计部分条件平均治疗效果
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2000
Jiayi Wang, R. K. Wong, Shu Yang, K. C. G. Chan
{"title":"Estimation of partially conditional average treatment effect by double kernel-covariate balancing","authors":"Jiayi Wang, R. K. Wong, Shu Yang, K. C. G. Chan","doi":"10.1214/22-ejs2000","DOIUrl":"https://doi.org/10.1214/22-ejs2000","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49422993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Isotonic regression for elicitable functionals and their Bayes risk 可引出泛函的同调回归及其Bayes风险
IF 1.1 4区 数学 Q3 STATISTICS & PROBABILITY Pub Date : 2022-01-01 DOI: 10.1214/22-ejs2034
Anja Mühlemann, Johanna F. Ziegel
{"title":"Isotonic regression for elicitable functionals and their Bayes risk","authors":"Anja Mühlemann, Johanna F. Ziegel","doi":"10.1214/22-ejs2034","DOIUrl":"https://doi.org/10.1214/22-ejs2034","url":null,"abstract":"","PeriodicalId":49272,"journal":{"name":"Electronic Journal of Statistics","volume":" ","pages":""},"PeriodicalIF":1.1,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44276319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Electronic Journal of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1