{"title":"基于网格邻域搜索的密度峰空间聚类","authors":"Shaotong Luan, Cong Lu, Liang Bai, Haoran Wang","doi":"10.1109/IICSPI48186.2019.9095889","DOIUrl":null,"url":null,"abstract":"In the application of spatial data clustering, the density-based clustering method can achieve good results. DPC algorithm is a density-based clustering algorithm, which can discover the clustering of irregular shapes. The algorithm is trustworthy of clustering results, simple to implement, and parameter robust. However, the DPC algorithm needs to calculate the distance between the two pairs. It takes a long time to calculate the local density and high-density distance for large-scale spatial data sets. To solve the problem of low efficiency in large datasets, this paper improved the DPC algorithm and proposed a density peak clustering algorithm, DPSCGNS, based on grid neighborhood search. DPSCGNS map raw data to grid cells and redefine the local distance and high-density distance of grid cells. By using the grid to index neighborhood information, the local density and high-density distance of grid cells can be calculated rapidly. Experiments on several data sets demonstrate that the efficiency of DPSCGNS algorithm is improved without decline on clustering effect.","PeriodicalId":318693,"journal":{"name":"2019 2nd International Conference on Safety Produce Informatization (IICSPI)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Density Peaks Spatial Clustering by Grid Neighborhood Search\",\"authors\":\"Shaotong Luan, Cong Lu, Liang Bai, Haoran Wang\",\"doi\":\"10.1109/IICSPI48186.2019.9095889\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the application of spatial data clustering, the density-based clustering method can achieve good results. DPC algorithm is a density-based clustering algorithm, which can discover the clustering of irregular shapes. The algorithm is trustworthy of clustering results, simple to implement, and parameter robust. However, the DPC algorithm needs to calculate the distance between the two pairs. It takes a long time to calculate the local density and high-density distance for large-scale spatial data sets. To solve the problem of low efficiency in large datasets, this paper improved the DPC algorithm and proposed a density peak clustering algorithm, DPSCGNS, based on grid neighborhood search. DPSCGNS map raw data to grid cells and redefine the local distance and high-density distance of grid cells. By using the grid to index neighborhood information, the local density and high-density distance of grid cells can be calculated rapidly. Experiments on several data sets demonstrate that the efficiency of DPSCGNS algorithm is improved without decline on clustering effect.\",\"PeriodicalId\":318693,\"journal\":{\"name\":\"2019 2nd International Conference on Safety Produce Informatization (IICSPI)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 2nd International Conference on Safety Produce Informatization (IICSPI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICSPI48186.2019.9095889\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 2nd International Conference on Safety Produce Informatization (IICSPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICSPI48186.2019.9095889","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Density Peaks Spatial Clustering by Grid Neighborhood Search
In the application of spatial data clustering, the density-based clustering method can achieve good results. DPC algorithm is a density-based clustering algorithm, which can discover the clustering of irregular shapes. The algorithm is trustworthy of clustering results, simple to implement, and parameter robust. However, the DPC algorithm needs to calculate the distance between the two pairs. It takes a long time to calculate the local density and high-density distance for large-scale spatial data sets. To solve the problem of low efficiency in large datasets, this paper improved the DPC algorithm and proposed a density peak clustering algorithm, DPSCGNS, based on grid neighborhood search. DPSCGNS map raw data to grid cells and redefine the local distance and high-density distance of grid cells. By using the grid to index neighborhood information, the local density and high-density distance of grid cells can be calculated rapidly. Experiments on several data sets demonstrate that the efficiency of DPSCGNS algorithm is improved without decline on clustering effect.