{"title":"基于最小脊协方差行列式估计的离群点检测","authors":"Chikun Li, B. Jin, Yuehua Wu","doi":"10.5705/ss.202022.0142","DOIUrl":null,"url":null,"abstract":": In this paper, we propose an outlier detection procedure, based on a high-breakdown minimum ridge covariance determinant estimator that is especially useful for the large p/n scenario. The estimator is obtained from the subset of observations, after excluding potential outliers, by applying the so-called concentration steps. We explore the asymptotic distribution of the modified Mahalanobis distance related to the proposed estimator under certain moment conditions, and obtain a theoretical cutoff value for outlier identification. We also improve the outlier detection power by adding a one-step reweighting procedure. Lastly, we investigate the performance of the proposed methods using simulations and a real-data analysis.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"1 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Outlier Detection via a Minimum Ridge Covariance Determinant Estimator\",\"authors\":\"Chikun Li, B. Jin, Yuehua Wu\",\"doi\":\"10.5705/ss.202022.0142\",\"DOIUrl\":null,\"url\":null,\"abstract\":\": In this paper, we propose an outlier detection procedure, based on a high-breakdown minimum ridge covariance determinant estimator that is especially useful for the large p/n scenario. The estimator is obtained from the subset of observations, after excluding potential outliers, by applying the so-called concentration steps. We explore the asymptotic distribution of the modified Mahalanobis distance related to the proposed estimator under certain moment conditions, and obtain a theoretical cutoff value for outlier identification. We also improve the outlier detection power by adding a one-step reweighting procedure. Lastly, we investigate the performance of the proposed methods using simulations and a real-data analysis.\",\"PeriodicalId\":49478,\"journal\":{\"name\":\"Statistica Sinica\",\"volume\":\"1 1\",\"pages\":\"\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistica Sinica\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.5705/ss.202022.0142\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Sinica","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.5705/ss.202022.0142","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Outlier Detection via a Minimum Ridge Covariance Determinant Estimator
: In this paper, we propose an outlier detection procedure, based on a high-breakdown minimum ridge covariance determinant estimator that is especially useful for the large p/n scenario. The estimator is obtained from the subset of observations, after excluding potential outliers, by applying the so-called concentration steps. We explore the asymptotic distribution of the modified Mahalanobis distance related to the proposed estimator under certain moment conditions, and obtain a theoretical cutoff value for outlier identification. We also improve the outlier detection power by adding a one-step reweighting procedure. Lastly, we investigate the performance of the proposed methods using simulations and a real-data analysis.
期刊介绍:
Statistica Sinica aims to meet the needs of statisticians in a rapidly changing world. It provides a forum for the publication of innovative work of high quality in all areas of statistics, including theory, methodology and applications. The journal encourages the development and principled use of statistical methodology that is relevant for society, science and technology.