{"title":"基于未知和非齐次方差下的非参数极大似然估计的高维分类","authors":"Hoyoung Park, Seungchul Baek, Junyong Park","doi":"10.1002/sam.11554","DOIUrl":null,"url":null,"abstract":"We propose a new method in high‐dimensional classification based on estimation of high‐dimensional mean vector under unknown and unequal variances. Our proposed method is based on a semi‐parametric model that combines nonparametric and parametric models for mean and variance, respectively. Our proposed method is designed to be robust to the structure of the mean vector, while most existing methods are developed for some specific cases such as either sparse or non‐sparse case of the mean vector. In addition, we also consider estimating mean and variance separately under nonparametric empirical Bayes framework that has advantage over existing nonparametric empirical Bayes classifiers based on standardization. We present simulation studies showing that our proposed method outperforms a variety of existing methods. Application to real data sets demonstrates robustness of our method to various types of data sets, while all other methods produce either sensitive or poor results for different data sets.","PeriodicalId":342679,"journal":{"name":"Statistical Analysis and Data Mining: The ASA Data Science Journal","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High‐dimensional classification based on nonparametric maximum likelihood estimation under unknown and inhomogeneous variances\",\"authors\":\"Hoyoung Park, Seungchul Baek, Junyong Park\",\"doi\":\"10.1002/sam.11554\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a new method in high‐dimensional classification based on estimation of high‐dimensional mean vector under unknown and unequal variances. Our proposed method is based on a semi‐parametric model that combines nonparametric and parametric models for mean and variance, respectively. Our proposed method is designed to be robust to the structure of the mean vector, while most existing methods are developed for some specific cases such as either sparse or non‐sparse case of the mean vector. In addition, we also consider estimating mean and variance separately under nonparametric empirical Bayes framework that has advantage over existing nonparametric empirical Bayes classifiers based on standardization. We present simulation studies showing that our proposed method outperforms a variety of existing methods. Application to real data sets demonstrates robustness of our method to various types of data sets, while all other methods produce either sensitive or poor results for different data sets.\",\"PeriodicalId\":342679,\"journal\":{\"name\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistical Analysis and Data Mining: The ASA Data Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/sam.11554\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical Analysis and Data Mining: The ASA Data Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/sam.11554","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
High‐dimensional classification based on nonparametric maximum likelihood estimation under unknown and inhomogeneous variances
We propose a new method in high‐dimensional classification based on estimation of high‐dimensional mean vector under unknown and unequal variances. Our proposed method is based on a semi‐parametric model that combines nonparametric and parametric models for mean and variance, respectively. Our proposed method is designed to be robust to the structure of the mean vector, while most existing methods are developed for some specific cases such as either sparse or non‐sparse case of the mean vector. In addition, we also consider estimating mean and variance separately under nonparametric empirical Bayes framework that has advantage over existing nonparametric empirical Bayes classifiers based on standardization. We present simulation studies showing that our proposed method outperforms a variety of existing methods. Application to real data sets demonstrates robustness of our method to various types of data sets, while all other methods produce either sensitive or poor results for different data sets.