首页 > 最新文献

The Annals of Statistics最新文献

英文 中文
Optimal difference-based variance estimators in time series: A general framework 时间序列中基于差分的最优方差估计:一般框架
Pub Date : 2021-12-30 DOI: 10.1214/21-aos2154
Kin Wai Chan
Variance estimation is important for statistical inference. It becomes nontrivial when observations are masked by serial dependence structures and time-varying mean structures. Existing methods either ignore or sub-optimally handle these nuisance structures. This paper develops a general framework for the estimation of the long-run variance for time series with nonconstant means. The building blocks are difference statistics. The proposed class of estimators is general enough to cover many existing estimators. Necessary and sufficient conditions for consistency are investigated. The first asymptotically optimal estimator is derived. Our proposed estimator is the-oretically proven to be invariant to arbitrary mean structures, which may include trends and a possibly divergent number of discontinuities.
方差估计对统计推断非常重要。当观测值被序列依赖结构和时变平均结构所掩盖时,它就变得不平凡了。现有的方法要么忽略这些讨厌的结构,要么处理得不够理想。本文提出了非常均值时间序列长期方差估计的一般框架。构建模块是差异统计。所提出的估计器类是足够通用的,可以涵盖许多现有的估计器。研究了一致性的充分必要条件。导出了第一个渐近最优估计量。我们提出的估计量在理论上被证明对任意平均结构是不变的,这些平均结构可能包括趋势和可能分散的不连续数。
{"title":"Optimal difference-based variance estimators in time series: A general framework","authors":"Kin Wai Chan","doi":"10.1214/21-aos2154","DOIUrl":"https://doi.org/10.1214/21-aos2154","url":null,"abstract":"Variance estimation is important for statistical inference. It becomes nontrivial when observations are masked by serial dependence structures and time-varying mean structures. Existing methods either ignore or sub-optimally handle these nuisance structures. This paper develops a general framework for the estimation of the long-run variance for time series with nonconstant means. The building blocks are difference statistics. The proposed class of estimators is general enough to cover many existing estimators. Necessary and sufficient conditions for consistency are investigated. The first asymptotically optimal estimator is derived. Our proposed estimator is the-oretically proven to be invariant to arbitrary mean structures, which may include trends and a possibly divergent number of discontinuities.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"43 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84555122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Total positivity in multivariate extremes 多元极值中的总正性
Pub Date : 2021-12-29 DOI: 10.1214/23-aos2272
Frank Rottger, Sebastian Engelke, Piotr Zwiernik
Positive dependence is present in many real world data sets and has appealing stochastic properties that can be exploited in statistical modeling and in estimation. In particular, the notion of multivariate total positivity of order 2 ($ mathrm{MTP}_{2} $) is a convex constraint and acts as an implicit regularizer in the Gaussian case. We study positive dependence in multivariate extremes and introduce $ mathrm{EMTP}_{2} $, an extremal version of $ mathrm{MTP}_{2} $. This notion turns out to appear prominently in extremes, and in fact, it is satisfied by many classical models. For a H"usler--Reiss distribution, the analogue of a Gaussian distribution in extremes, we show that it is $ mathrm{EMTP}_{2} $ if and only if its precision matrix is a Laplacian of a connected graph. We propose an estimator for the parameters of the H"usler--Reiss distribution under $ mathrm{EMTP}_{2} $ as the solution of a convex optimization problem with Laplacian constraint. We prove that this estimator is consistent and typically yields a sparse model with possibly nondecomposable extremal graphical structure. Applying our methods to a data set of Danube River flows, we illustrate this regularization and the superior performance compared to existing methods.
正相关性存在于许多真实世界的数据集中,并且具有可用于统计建模和估计的吸引人的随机特性。特别地,2阶($ mathm {MTP}_{2} $)的多元全正性的概念是一个凸约束,在高斯情况下充当隐式正则化器。我们研究了多元极值的正相关性,并引入了$ mathm {EMTP}_{2} $的极值版本$ mathm {MTP}_{2} $。这个概念在极端情况下很明显,事实上,它被许多经典模型所满足。对于一个H usler—Reiss分布,一个极值高斯分布的模拟,我们证明了它是$ mathm {EMTP}_{2} $当且仅当它的精度矩阵是连通图的拉普拉斯矩阵。我们提出了$ mathm {EMTP}_{2} $条件下H usler—Reiss分布参数的一个估计量,作为具有拉普拉斯约束的凸优化问题的解。我们证明了这个估计量是一致的,并且通常会得到一个可能具有不可分解的极值图结构的稀疏模型。将我们的方法应用于多瑙河流量的数据集,我们说明了这种正则化和与现有方法相比优越的性能。
{"title":"Total positivity in multivariate extremes","authors":"Frank Rottger, Sebastian Engelke, Piotr Zwiernik","doi":"10.1214/23-aos2272","DOIUrl":"https://doi.org/10.1214/23-aos2272","url":null,"abstract":"Positive dependence is present in many real world data sets and has appealing stochastic properties that can be exploited in statistical modeling and in estimation. In particular, the notion of multivariate total positivity of order 2 ($ mathrm{MTP}_{2} $) is a convex constraint and acts as an implicit regularizer in the Gaussian case. We study positive dependence in multivariate extremes and introduce $ mathrm{EMTP}_{2} $, an extremal version of $ mathrm{MTP}_{2} $. This notion turns out to appear prominently in extremes, and in fact, it is satisfied by many classical models. For a H\"usler--Reiss distribution, the analogue of a Gaussian distribution in extremes, we show that it is $ mathrm{EMTP}_{2} $ if and only if its precision matrix is a Laplacian of a connected graph. We propose an estimator for the parameters of the H\"usler--Reiss distribution under $ mathrm{EMTP}_{2} $ as the solution of a convex optimization problem with Laplacian constraint. We prove that this estimator is consistent and typically yields a sparse model with possibly nondecomposable extremal graphical structure. Applying our methods to a data set of Danube River flows, we illustrate this regularization and the superior performance compared to existing methods.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82875264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Local permutation tests for conditional independence 条件独立性的局部排列检验
Pub Date : 2021-12-22 DOI: 10.1214/22-aos2233
Ilmun Kim, Matey Neykov, Sivaraman Balakrishnan, L. Wasserman
In this paper, we investigate local permutation tests for testing conditional independence between two random vectors X and Y given Z . The local permutation test determines the significance of a test statistic by locally shuffling samples which share similar values of the conditioning variables Z , and it forms a natural extension of the usual permutation approach for unconditional independence testing. Despite its simplicity and empirical support, the theoretical underpinnings of the local permutation test remain unclear. Motivated by this gap, this paper aims to establish theoretical foundations of local permutation tests with a particular focus on binning-based statistics. We start by revisiting the hardness of conditional independence testing and provide an upper bound for the power of any valid conditional independence test, which holds when the probability of observing “collisions” in Z is small. This negative result naturally motivates us to impose additional restrictions on the possible distributions under the null and alternate. To this end, we focus our attention on certain classes of smooth distributions and identify provably tight conditions under which the local permutation method is universally valid, i.e. it is valid when applied to any (binning-based) test statistic. To complement this result on type I error control, we also show that in some cases, a binning-based statistic calibrated via the local permutation method can achieve minimax optimal power. We also introduce a double-binning permutation strategy, which yields a valid test over less smooth null distributions than the typical single-binning method without compromising much power. Finally, we present simulation results to support our theoretical
在给定Z的情况下,研究了检验两个随机向量X和Y之间条件独立性的局部置换检验。局部置换检验通过对条件变量Z值相近的样本进行局部洗牌来确定检验统计量的显著性,是对通常的无条件独立性检验的置换方法的自然扩展。尽管它的简单性和经验支持,局部排列检验的理论基础仍然不清楚。基于这一差距,本文旨在建立局部排列检验的理论基础,并特别关注基于分类的统计。我们首先重新审视条件独立测试的难度,并为任何有效的条件独立测试的幂提供一个上界,当观察到Z中的“碰撞”概率很小时,这个上界成立。这个消极的结果自然促使我们对null和alternate下的可能分布施加额外的限制。为此,我们将注意力集中在光滑分布的某些类别上,并确定局部排列方法普遍有效的可证明紧条件,即当应用于任何(基于binning的)检验统计量时,它都是有效的。为了在I型误差控制上补充这一结果,我们还表明,在某些情况下,通过局部排列方法校准的基于分类的统计量可以达到最小最大最优功率。我们还引入了一种双分箱排列策略,它比典型的单分箱方法在不太光滑的零分布上产生有效的测试,而不会牺牲太多的功率。最后,我们给出了仿真结果来支持我们的理论
{"title":"Local permutation tests for conditional independence","authors":"Ilmun Kim, Matey Neykov, Sivaraman Balakrishnan, L. Wasserman","doi":"10.1214/22-aos2233","DOIUrl":"https://doi.org/10.1214/22-aos2233","url":null,"abstract":"In this paper, we investigate local permutation tests for testing conditional independence between two random vectors X and Y given Z . The local permutation test determines the significance of a test statistic by locally shuffling samples which share similar values of the conditioning variables Z , and it forms a natural extension of the usual permutation approach for unconditional independence testing. Despite its simplicity and empirical support, the theoretical underpinnings of the local permutation test remain unclear. Motivated by this gap, this paper aims to establish theoretical foundations of local permutation tests with a particular focus on binning-based statistics. We start by revisiting the hardness of conditional independence testing and provide an upper bound for the power of any valid conditional independence test, which holds when the probability of observing “collisions” in Z is small. This negative result naturally motivates us to impose additional restrictions on the possible distributions under the null and alternate. To this end, we focus our attention on certain classes of smooth distributions and identify provably tight conditions under which the local permutation method is universally valid, i.e. it is valid when applied to any (binning-based) test statistic. To complement this result on type I error control, we also show that in some cases, a binning-based statistic calibrated via the local permutation method can achieve minimax optimal power. We also introduce a double-binning permutation strategy, which yields a valid test over less smooth null distributions than the typical single-binning method without compromising much power. Finally, we present simulation results to support our theoretical","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"62 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85565845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
The integrated copula spectrum 积分共轭谱
Pub Date : 2021-12-14 DOI: 10.1214/22-AOS2240
Yuichi Goto, Tobias Kley, Ria Van Hecke, S. Volgushev, H. Dette, M. Hallin
Frequency domain methods form a ubiquitous part of the statistical tool-box for time series analysis. In recent years, considerable interest has been given to the development of new spectral methodology and tools capturing dynamics in the entire joint distributions and thus avoiding the limitations of classical, L 2 -based spectral methods. Most of the spectral concepts proposed in that literature suffer from one major drawback, though: their estimation re-quires the choice of a smoothing parameter, which has a considerable impact on estimation quality and poses challenges for statistical inference. In this paper, associated with the concept of copula-based spectrum, we introduce the notion of copula spectral distribution function or integrated copula spectrum . This integrated copula spectrum retains the advantages of copula-based spectra but can be estimated without the need for smoothing parameters. We provide such estimators, along with a thorough theoretical analysis, based on a functional central limit theorem, of their asymptotic properties. We leverage these results to test various hypotheses that cannot be addressed by classical spectral methods, such as the lack of time-reversibility or asymmetry in tail dynamics.
频域方法构成了时间序列分析统计工具箱中无处不在的一部分。近年来,人们对开发新的光谱方法和工具产生了相当大的兴趣,这些方法和工具可以捕获整个联合分布中的动态,从而避免经典的基于l2的光谱方法的局限性。这些文献中提出的大多数光谱概念都有一个主要缺点:它们的估计需要选择平滑参数,这对估计质量有相当大的影响,并对统计推断提出了挑战。本文结合基于联结谱的概念,引入联结谱分布函数或积分联结谱的概念。这种集成的copula谱保留了基于copula谱的优点,但不需要平滑参数即可进行估计。我们提供了这样的估计量,并基于泛函中心极限定理对它们的渐近性质进行了彻底的理论分析。我们利用这些结果来测试经典光谱方法无法解决的各种假设,例如缺乏时间可逆性或尾部动力学的不对称性。
{"title":"The integrated copula spectrum","authors":"Yuichi Goto, Tobias Kley, Ria Van Hecke, S. Volgushev, H. Dette, M. Hallin","doi":"10.1214/22-AOS2240","DOIUrl":"https://doi.org/10.1214/22-AOS2240","url":null,"abstract":"Frequency domain methods form a ubiquitous part of the statistical tool-box for time series analysis. In recent years, considerable interest has been given to the development of new spectral methodology and tools capturing dynamics in the entire joint distributions and thus avoiding the limitations of classical, L 2 -based spectral methods. Most of the spectral concepts proposed in that literature suffer from one major drawback, though: their estimation re-quires the choice of a smoothing parameter, which has a considerable impact on estimation quality and poses challenges for statistical inference. In this paper, associated with the concept of copula-based spectrum, we introduce the notion of copula spectral distribution function or integrated copula spectrum . This integrated copula spectrum retains the advantages of copula-based spectra but can be estimated without the need for smoothing parameters. We provide such estimators, along with a thorough theoretical analysis, based on a functional central limit theorem, of their asymptotic properties. We leverage these results to test various hypotheses that cannot be addressed by classical spectral methods, such as the lack of time-reversibility or asymmetry in tail dynamics.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"38 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76479775","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of generalized Bregman surrogate algorithms for nonsmooth nonconvex statistical learning 非光滑非凸统计学习的广义Bregman代理算法分析
Pub Date : 2021-12-01 DOI: 10.1214/21-aos2090
Yiyuan She, Zhifeng Wang, Jiuwu Jin
Modern statistical applications often involve minimizing an objective function that may be nonsmooth and/or nonconvex. This paper focuses on a broad Bregman-surrogate algorithm framework including the local linear approximation, mirror descent, iterative thresholding, DC programming and many others as particular instances. The re-characterization via generalized Bregman functions enables us to construct suitable error measures and establish global convergence rates for nonconvex and nonsmooth objectives in possibly high dimensions. For sparse learning problems with a composite objective, under some regularity conditions, the obtained estimators as the surrogate’s fixed points, though not necessarily local minimizers, enjoy provable statistical guarantees, and the sequence of iterates can be shown to approach the statistical truth within the desired accuracy geometrically fast. The paper also studies how to design adaptive momentum based accelerations without assuming convexity or smoothness by carefully controlling stepsize and relaxation parameters.
现代统计应用通常涉及最小化可能是非光滑和/或非凸的目标函数。本文重点讨论了广义的bregman - proxy算法框架,包括局部线性逼近、镜像下降、迭代阈值分割、DC规划和许多其他具体实例。通过广义Bregman函数的重新表征使我们能够构建合适的误差度量并建立可能高维的非凸和非光滑目标的全局收敛率。对于具有复合目标的稀疏学习问题,在一定的正则性条件下,得到的估计量作为代理的不动点,虽然不一定是局部极小值,但具有可证明的统计保证,并且迭代序列可以在几何上快速地接近所需精度的统计真值。本文还研究了如何通过仔细控制步长和松弛参数来设计基于动量的自适应加速度,而不假设其凹凸性或光滑性。
{"title":"Analysis of generalized Bregman surrogate algorithms for nonsmooth nonconvex statistical learning","authors":"Yiyuan She, Zhifeng Wang, Jiuwu Jin","doi":"10.1214/21-aos2090","DOIUrl":"https://doi.org/10.1214/21-aos2090","url":null,"abstract":"Modern statistical applications often involve minimizing an objective function that may be nonsmooth and/or nonconvex. This paper focuses on a broad Bregman-surrogate algorithm framework including the local linear approximation, mirror descent, iterative thresholding, DC programming and many others as particular instances. The re-characterization via generalized Bregman functions enables us to construct suitable error measures and establish global convergence rates for nonconvex and nonsmooth objectives in possibly high dimensions. For sparse learning problems with a composite objective, under some regularity conditions, the obtained estimators as the surrogate’s fixed points, though not necessarily local minimizers, enjoy provable statistical guarantees, and the sequence of iterates can be shown to approach the statistical truth within the desired accuracy geometrically fast. The paper also studies how to design adaptive momentum based accelerations without assuming convexity or smoothness by carefully controlling stepsize and relaxation parameters.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"509 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76404726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
On fixed-domain asymptotics, parameter estimation and isotropic Gaussian random fields with Matérn covariance functions 关于固定域渐近性、参数估计和各向同性高斯随机场的mat<s:1>协方差函数
Pub Date : 2021-12-01 DOI: 10.1214/21-aos2077
Wei-Liem Loh, Saifei Sun, J. Wen
{"title":"On fixed-domain asymptotics, parameter estimation and isotropic Gaussian random fields with Matérn covariance functions","authors":"Wei-Liem Loh, Saifei Sun, J. Wen","doi":"10.1214/21-aos2077","DOIUrl":"https://doi.org/10.1214/21-aos2077","url":null,"abstract":"","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"9 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82034007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Optimal linear discriminators for the discrete choice model in growing dimensions 增长维数下离散选择模型的最优线性判别器
Pub Date : 2021-12-01 DOI: 10.1214/21-aos2085
Debarghya Mukherjee, M. Banerjee
Manski’s celebrated maximum score estimator for the discrete choice model, which is an optimal linear discriminator, has been the focus of much investigation in both the econometrics and statistics literatures, but its behavior under growing dimension scenarios largely remains unknown. This paper addresses that gap. Two different cases are considered: p grows with n but at a slow rate, i.e. p/n→ 0; and p n (fast growth). In the binary response model, we recast Manski’s score estimation as empirical risk minimization for a classification problem, and derive the `2 rate of convergence of the score estimator under a new transition condition in terms of a margin parameter that calibrates the level of difficulty of the estimation problem. We also establish upper and lower bounds for the minimax `2 error in the binary choice model that differ by a logarithmic factor, and construct a minimax-optimal estimator in the slow growth regime. Some extensions to the multinomial choice model are also considered.
Manski著名的离散选择模型的最大分数估计器是一种最优线性判别器,在计量经济学和统计学文献中一直是许多研究的焦点,但它在增长维场景下的行为在很大程度上仍然未知。本文解决了这一差距。考虑两种不同的情况:p随n增长,但速度缓慢,即p/n→0;pn(快速增长)在二元响应模型中,我们将Manski的分数估计重新定义为分类问题的经验风险最小化,并根据校准估计问题难易程度的余量参数导出了分数估计器在新的过渡条件下的' 2收敛率。我们还建立了二元选择模型中存在一个对数因子差异的最小最大2误差的上界和下界,并构造了慢增长条件下的最小最优估计量。本文还考虑了多项选择模型的一些扩展。
{"title":"Optimal linear discriminators for the discrete choice model in growing dimensions","authors":"Debarghya Mukherjee, M. Banerjee","doi":"10.1214/21-aos2085","DOIUrl":"https://doi.org/10.1214/21-aos2085","url":null,"abstract":"Manski’s celebrated maximum score estimator for the discrete choice model, which is an optimal linear discriminator, has been the focus of much investigation in both the econometrics and statistics literatures, but its behavior under growing dimension scenarios largely remains unknown. This paper addresses that gap. Two different cases are considered: p grows with n but at a slow rate, i.e. p/n→ 0; and p n (fast growth). In the binary response model, we recast Manski’s score estimation as empirical risk minimization for a classification problem, and derive the `2 rate of convergence of the score estimator under a new transition condition in terms of a margin parameter that calibrates the level of difficulty of the estimation problem. We also establish upper and lower bounds for the minimax `2 error in the binary choice model that differ by a logarithmic factor, and construct a minimax-optimal estimator in the slow growth regime. Some extensions to the multinomial choice model are also considered.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"29 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80166810","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Posterior analysis of n in the binomial (n,p) problem with both parameters unknown—with applications to quantitative nanoscopy 两个参数都未知的二项(n,p)问题中n的后验分析-应用于定量纳米显微镜
Pub Date : 2021-12-01 DOI: 10.1214/21-aos2096
Johannes Schmidt-Hieber, Laura Fee Schneider, Thomas Staudt, A. Krajina, Timo Aspelmeier, Axel Munk
Estimation of the population size n from k i.i
从k。i。i估计总体大小n
{"title":"Posterior analysis of n in the binomial (n,p) problem with both parameters unknown—with applications to quantitative nanoscopy","authors":"Johannes Schmidt-Hieber, Laura Fee Schneider, Thomas Staudt, A. Krajina, Timo Aspelmeier, Axel Munk","doi":"10.1214/21-aos2096","DOIUrl":"https://doi.org/10.1214/21-aos2096","url":null,"abstract":"Estimation of the population size n from k i.i","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"64 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80336383","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Rate-optimal cluster-randomized designs for spatial interference 空间干扰率最优聚类随机设计
Pub Date : 2021-11-08 DOI: 10.1214/22-aos2224
Michael P. Leung
We consider a potential outcomes model in which interference may be present between any two units but the extent of interference diminishes with spatial distance. The causal estimand is the global average treatment effect, which compares counterfactual outcomes when all units are treated to those when none are. We study a class of designs in which space is partitioned into clusters that are randomized into treatment and control. For each design, we estimate the treatment effect using a Horvitz-Thompson estimator that compares the average outcomes of units with all neighbors treated to units with no treated neighbors, where the neighborhood radius is of the same order as the cluster size dictated by the design. We derive the estimator’s rate of convergence as a function of the design and degree of interference and use this to obtain estimator-design pairs that achieve near-optimal rates of convergence under relatively minimal assumptions on interference. We prove that the estimators are asymptotically normal and provide a variance estimator. For practical implementation of the designs, we suggest partitioning space using clustering algorithms. only be directly observed in the data under an extreme design that assigns all units to the same treatment arm, which would necessarily preclude observation of the other counterfactual. Common designs used in the literature, including those studied here, assign different units to different treatment arms, so neither average is directly observed in the data. Nonetheless, we show that asymptotic inference on θ n is possible for a class of cluster-randomized designs under spatial interference where the degree of interference diminishes with distance.
我们考虑了一个潜在的结果模型,其中任何两个单元之间都可能存在干扰,但干扰的程度随着空间距离而减小。因果估计是全球平均治疗效果,将所有单位都接受治疗时的反事实结果与没有接受治疗时的反事实结果进行比较。我们研究了一类设计,其中空间被划分为随机分组,分为实验组和对照组。对于每个设计,我们使用Horvitz-Thompson估计器来估计处理效果,该估计器将所有邻居处理的单元的平均结果与未处理邻居的单元进行比较,其中邻居半径与设计规定的簇大小具有相同的顺序。我们推导了估计器的收敛速度作为设计和干扰程度的函数,并使用它来获得在相对最小的干扰假设下实现近最优收敛速度的估计器-设计对。我们证明了这些估计量是渐近正态的,并给出了方差估计量。对于设计的实际实现,我们建议使用聚类算法划分空间。只有在将所有单位分配到相同处理臂的极端设计下,才能直接观察到数据,这必然会排除对其他反事实的观察。文献中使用的常见设计,包括本文研究的设计,将不同的单位分配到不同的治疗组,因此在数据中没有直接观察到平均值。尽管如此,我们证明了在干扰程度随距离减小的空间干扰下,一类簇随机设计对θ n的渐近推断是可能的。
{"title":"Rate-optimal cluster-randomized designs for spatial interference","authors":"Michael P. Leung","doi":"10.1214/22-aos2224","DOIUrl":"https://doi.org/10.1214/22-aos2224","url":null,"abstract":"We consider a potential outcomes model in which interference may be present between any two units but the extent of interference diminishes with spatial distance. The causal estimand is the global average treatment effect, which compares counterfactual outcomes when all units are treated to those when none are. We study a class of designs in which space is partitioned into clusters that are randomized into treatment and control. For each design, we estimate the treatment effect using a Horvitz-Thompson estimator that compares the average outcomes of units with all neighbors treated to units with no treated neighbors, where the neighborhood radius is of the same order as the cluster size dictated by the design. We derive the estimator’s rate of convergence as a function of the design and degree of interference and use this to obtain estimator-design pairs that achieve near-optimal rates of convergence under relatively minimal assumptions on interference. We prove that the estimators are asymptotically normal and provide a variance estimator. For practical implementation of the designs, we suggest partitioning space using clustering algorithms. only be directly observed in the data under an extreme design that assigns all units to the same treatment arm, which would necessarily preclude observation of the other counterfactual. Common designs used in the literature, including those studied here, assign different units to different treatment arms, so neither average is directly observed in the data. Nonetheless, we show that asymptotic inference on θ n is possible for a class of cluster-randomized designs under spatial interference where the degree of interference diminishes with distance.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"60 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84708981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
On the disjoint and sliding block maxima method for piecewise stationary time series 分段平稳时间序列的不相交滑动块极大值法
Pub Date : 2021-10-29 DOI: 10.1214/23-aos2260
Axel Bucher, L. Zanger
Modeling univariate block maxima by the generalized extreme value distribution constitutes one of the most widely applied approaches in extreme value statistics. It has recently been found that, for an underlying stationary time series, respective estimators may be improved by calculating block maxima in an overlapping way. A proof of concept is provided that the latter finding also holds in situations that involve certain piecewise stationarities. A weak convergence result for an empirical process of central interest is provided, and, as a case-in-point, further details are worked out explicitly for the probability weighted moment estimator. Irrespective of the serial dependence, the estimation variance is shown to be smaller for the new estimator, while the bias was found to be the same or vary comparably little in extensive simulation experiments. The results are illustrated by Monte Carlo simulation experiments and are applied to a common situation involving temperature extremes in a changing climate.
利用广义极值分布对单变量块极大值进行建模是极值统计中应用最广泛的方法之一。最近发现,对于底层平稳时间序列,可以通过以重叠的方式计算块最大值来改进各自的估计量。提供了概念证明,后者的发现也适用于涉及某些分段平稳性的情况。提供了一个中心兴趣的经验过程的弱收敛结果,并且作为一个实例,明确地为概率加权矩估计器制定了进一步的细节。无论序列依赖性如何,新估计器的估计方差较小,而在广泛的模拟实验中发现偏差相同或变化相对较小。结果通过蒙特卡罗模拟实验加以说明,并应用于气候变化中涉及极端温度的常见情况。
{"title":"On the disjoint and sliding block maxima method for piecewise stationary time series","authors":"Axel Bucher, L. Zanger","doi":"10.1214/23-aos2260","DOIUrl":"https://doi.org/10.1214/23-aos2260","url":null,"abstract":"Modeling univariate block maxima by the generalized extreme value distribution constitutes one of the most widely applied approaches in extreme value statistics. It has recently been found that, for an underlying stationary time series, respective estimators may be improved by calculating block maxima in an overlapping way. A proof of concept is provided that the latter finding also holds in situations that involve certain piecewise stationarities. A weak convergence result for an empirical process of central interest is provided, and, as a case-in-point, further details are worked out explicitly for the probability weighted moment estimator. Irrespective of the serial dependence, the estimation variance is shown to be smaller for the new estimator, while the bias was found to be the same or vary comparably little in extensive simulation experiments. The results are illustrated by Monte Carlo simulation experiments and are applied to a common situation involving temperature extremes in a changing climate.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"90 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79403685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
The Annals of Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1