{"title":"用模糊c均值聚类改进Parzen密度估计的泛化","authors":"Jing Zhou, Yushi Yang, Yajing Zhang","doi":"10.1109/ICSESS.2012.6269406","DOIUrl":null,"url":null,"abstract":"Using fuzzy c-means clustering procedure to find a condensed set for Parzen windows estimation (ParzenFCMC) is proposed in this paper. The full Parzen windows estimator usually requires more computation and storage. However, the experimental simulations show that the significant increase of reference data may not improve the estimation performance of Parzen windows method obviously. In addition, the theoretical analysis validates the traditional Parzen windows estimator is sensitive to noise data. Thus, in order to improve the generalization capability (i.e., the adaptability to nosie data) of Parzen windows estimation, we try to find a condensed dataset to conduct the probability density estimation by adopting the following measures: 1) clustering the original dataset by using fuzzy c-means; 2) estimating the underlying density function based on the condensed reference set. Finally, the experimental results on the synthetic datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show the usefulness and effectiveness of proposed ParzenFCMC. The significant savings on computation and storage can be achieved with only minimal mean integrated squared error (MISE) degradation.","PeriodicalId":205738,"journal":{"name":"2012 IEEE International Conference on Computer Science and Automation Engineering","volume":"16 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving generalization of Parzen density estimation by fuzzy c-means clustering\",\"authors\":\"Jing Zhou, Yushi Yang, Yajing Zhang\",\"doi\":\"10.1109/ICSESS.2012.6269406\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Using fuzzy c-means clustering procedure to find a condensed set for Parzen windows estimation (ParzenFCMC) is proposed in this paper. The full Parzen windows estimator usually requires more computation and storage. However, the experimental simulations show that the significant increase of reference data may not improve the estimation performance of Parzen windows method obviously. In addition, the theoretical analysis validates the traditional Parzen windows estimator is sensitive to noise data. Thus, in order to improve the generalization capability (i.e., the adaptability to nosie data) of Parzen windows estimation, we try to find a condensed dataset to conduct the probability density estimation by adopting the following measures: 1) clustering the original dataset by using fuzzy c-means; 2) estimating the underlying density function based on the condensed reference set. Finally, the experimental results on the synthetic datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show the usefulness and effectiveness of proposed ParzenFCMC. The significant savings on computation and storage can be achieved with only minimal mean integrated squared error (MISE) degradation.\",\"PeriodicalId\":205738,\"journal\":{\"name\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"volume\":\"16 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-06-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Computer Science and Automation Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSESS.2012.6269406\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Computer Science and Automation Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2012.6269406","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving generalization of Parzen density estimation by fuzzy c-means clustering
Using fuzzy c-means clustering procedure to find a condensed set for Parzen windows estimation (ParzenFCMC) is proposed in this paper. The full Parzen windows estimator usually requires more computation and storage. However, the experimental simulations show that the significant increase of reference data may not improve the estimation performance of Parzen windows method obviously. In addition, the theoretical analysis validates the traditional Parzen windows estimator is sensitive to noise data. Thus, in order to improve the generalization capability (i.e., the adaptability to nosie data) of Parzen windows estimation, we try to find a condensed dataset to conduct the probability density estimation by adopting the following measures: 1) clustering the original dataset by using fuzzy c-means; 2) estimating the underlying density function based on the condensed reference set. Finally, the experimental results on the synthetic datasets obeying Uniform, Normal, Exponential, and Rayleigh distributions show the usefulness and effectiveness of proposed ParzenFCMC. The significant savings on computation and storage can be achieved with only minimal mean integrated squared error (MISE) degradation.