{"title":"Lp损失函数下多元定位的仿射等变推理","authors":"A. Dürre, D. Paindaveine","doi":"10.1214/22-aos2199","DOIUrl":null,"url":null,"abstract":"We consider the fundamental problem of estimating the location of a d -variate probability measure under an L p loss function. The naive estimator, that minimizes the usual empirical L p risk, has a known asymptotic behavior but suffers from several deficiencies for p (cid:2)= 2, the most important one being the lack of equivariance under general affine transformations. In this work, we introduce a collection of L p location estimators ˆ μ p,(cid:2)n that minimize the size of suitable (cid:2) -dimensional data-based simplices. For (cid:2) = 1, these estimators reduce to the naive ones, whereas, for (cid:2) = d , they are equivariant under affine transformations. Irrespective of (cid:2) , these estimators reduce to the sample mean for p = 2, whereas for p = 1, the estimators provide the well-known spatial median and Oja median for (cid:2) = 1 and (cid:2) = d , respectively. Under very mild assumptions, we derive an explicit Bahadur representation result for ˆ μ p,(cid:2)n and establish asymptotic normality. We prove that, quite remarkably, the asymptotic behavior of the estimators does not depend on (cid:2) under spherical symmetry, so that the affine equivariance for (cid:2) = d is achieved at no cost in terms of efficiency. To allow for large sample size n and/or large dimension d , we introduce a version of our estimators relying on incomplete U-statistics. Under a centro-symmetry assumption, we also define companion tests φ p,(cid:2)n for the problem of testing the null hypothesis that the location μ of the underlying probability measure coincides with a given location μ 0 . For any p , affine invariance is achieved for (cid:2) = d . For any (cid:2) and p , we derive explicit expressions for the asymptotic power of these tests under contiguous local alternatives, which reveals that asymptotic relative efficiencies with respect to traditional parametric Gaussian procedures for hypothesis testing coincide with those obtained for point estimation. We illustrate finite-sample relevance of our asymptotic results through Monte Carlo exercises and also treat a real data example.","PeriodicalId":22375,"journal":{"name":"The Annals of Statistics","volume":"73 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Affine-equivariant inference for multivariate location under Lp loss functions\",\"authors\":\"A. Dürre, D. Paindaveine\",\"doi\":\"10.1214/22-aos2199\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider the fundamental problem of estimating the location of a d -variate probability measure under an L p loss function. The naive estimator, that minimizes the usual empirical L p risk, has a known asymptotic behavior but suffers from several deficiencies for p (cid:2)= 2, the most important one being the lack of equivariance under general affine transformations. In this work, we introduce a collection of L p location estimators ˆ μ p,(cid:2)n that minimize the size of suitable (cid:2) -dimensional data-based simplices. For (cid:2) = 1, these estimators reduce to the naive ones, whereas, for (cid:2) = d , they are equivariant under affine transformations. Irrespective of (cid:2) , these estimators reduce to the sample mean for p = 2, whereas for p = 1, the estimators provide the well-known spatial median and Oja median for (cid:2) = 1 and (cid:2) = d , respectively. Under very mild assumptions, we derive an explicit Bahadur representation result for ˆ μ p,(cid:2)n and establish asymptotic normality. We prove that, quite remarkably, the asymptotic behavior of the estimators does not depend on (cid:2) under spherical symmetry, so that the affine equivariance for (cid:2) = d is achieved at no cost in terms of efficiency. To allow for large sample size n and/or large dimension d , we introduce a version of our estimators relying on incomplete U-statistics. Under a centro-symmetry assumption, we also define companion tests φ p,(cid:2)n for the problem of testing the null hypothesis that the location μ of the underlying probability measure coincides with a given location μ 0 . For any p , affine invariance is achieved for (cid:2) = d . For any (cid:2) and p , we derive explicit expressions for the asymptotic power of these tests under contiguous local alternatives, which reveals that asymptotic relative efficiencies with respect to traditional parametric Gaussian procedures for hypothesis testing coincide with those obtained for point estimation. We illustrate finite-sample relevance of our asymptotic results through Monte Carlo exercises and also treat a real data example.\",\"PeriodicalId\":22375,\"journal\":{\"name\":\"The Annals of Statistics\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"The Annals of Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1214/22-aos2199\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Annals of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1214/22-aos2199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Affine-equivariant inference for multivariate location under Lp loss functions
We consider the fundamental problem of estimating the location of a d -variate probability measure under an L p loss function. The naive estimator, that minimizes the usual empirical L p risk, has a known asymptotic behavior but suffers from several deficiencies for p (cid:2)= 2, the most important one being the lack of equivariance under general affine transformations. In this work, we introduce a collection of L p location estimators ˆ μ p,(cid:2)n that minimize the size of suitable (cid:2) -dimensional data-based simplices. For (cid:2) = 1, these estimators reduce to the naive ones, whereas, for (cid:2) = d , they are equivariant under affine transformations. Irrespective of (cid:2) , these estimators reduce to the sample mean for p = 2, whereas for p = 1, the estimators provide the well-known spatial median and Oja median for (cid:2) = 1 and (cid:2) = d , respectively. Under very mild assumptions, we derive an explicit Bahadur representation result for ˆ μ p,(cid:2)n and establish asymptotic normality. We prove that, quite remarkably, the asymptotic behavior of the estimators does not depend on (cid:2) under spherical symmetry, so that the affine equivariance for (cid:2) = d is achieved at no cost in terms of efficiency. To allow for large sample size n and/or large dimension d , we introduce a version of our estimators relying on incomplete U-statistics. Under a centro-symmetry assumption, we also define companion tests φ p,(cid:2)n for the problem of testing the null hypothesis that the location μ of the underlying probability measure coincides with a given location μ 0 . For any p , affine invariance is achieved for (cid:2) = d . For any (cid:2) and p , we derive explicit expressions for the asymptotic power of these tests under contiguous local alternatives, which reveals that asymptotic relative efficiencies with respect to traditional parametric Gaussian procedures for hypothesis testing coincide with those obtained for point estimation. We illustrate finite-sample relevance of our asymptotic results through Monte Carlo exercises and also treat a real data example.