{"title":"GCMDDBSCAN:基于网格和贡献的多密度DBSCAN","authors":"Linmeng Zhang, Zhigao Xu, Fengqi Si","doi":"10.1109/DASC.2013.115","DOIUrl":null,"url":null,"abstract":"Multi Density DBSCAN (Density Based Spatial Clustering of Application with Noise) is an excellent density-based clustering algorithm, which extends DBSCAN algorithm so as to be able to discover the different densities clusters, and retains the advantage of separating noise and finding arbitrary shape clusters. But, because of great memory demand and low calculation efficiency, Multi Density DBSCAN can't deal with large database. Therefore, GCMDDBSCAN is proposed in this paper, and within it 'migration-coefficient' conception is introduced firstly. In GCMDDBSCAN, with the grid technique, the optimization effect of contribution and migration-coefficient, and the efficient SP-tree query index, the runtime is reduced a lot, and the capability of clustering large database is obviously enhanced, at the same time, the accuracy of clustering result is not degraded.","PeriodicalId":179557,"journal":{"name":"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"GCMDDBSCAN: Multi-density DBSCAN Based on Grid and Contribution\",\"authors\":\"Linmeng Zhang, Zhigao Xu, Fengqi Si\",\"doi\":\"10.1109/DASC.2013.115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Multi Density DBSCAN (Density Based Spatial Clustering of Application with Noise) is an excellent density-based clustering algorithm, which extends DBSCAN algorithm so as to be able to discover the different densities clusters, and retains the advantage of separating noise and finding arbitrary shape clusters. But, because of great memory demand and low calculation efficiency, Multi Density DBSCAN can't deal with large database. Therefore, GCMDDBSCAN is proposed in this paper, and within it 'migration-coefficient' conception is introduced firstly. In GCMDDBSCAN, with the grid technique, the optimization effect of contribution and migration-coefficient, and the efficient SP-tree query index, the runtime is reduced a lot, and the capability of clustering large database is obviously enhanced, at the same time, the accuracy of clustering result is not degraded.\",\"PeriodicalId\":179557,\"journal\":{\"name\":\"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DASC.2013.115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE 11th International Conference on Dependable, Autonomic and Secure Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASC.2013.115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
摘要
Multi Density DBSCAN (Density Based Spatial Clustering of Application with Noise)是一种优秀的基于密度的聚类算法,它对DBSCAN算法进行了扩展,可以发现不同密度的聚类,同时保留了分离噪声和发现任意形状聚类的优点。但是,由于内存需求大和计算效率低,Multi - Density DBSCAN无法处理大型数据库。因此,本文提出了GCMDDBSCAN,并在其中首先引入了“迁移系数”的概念。在GCMDDBSCAN中,利用网格技术、贡献系数和迁移系数的优化效果以及高效的sp树查询索引,大大减少了运行时间,明显增强了大型数据库的聚类能力,同时不降低聚类结果的准确性。
GCMDDBSCAN: Multi-density DBSCAN Based on Grid and Contribution
Multi Density DBSCAN (Density Based Spatial Clustering of Application with Noise) is an excellent density-based clustering algorithm, which extends DBSCAN algorithm so as to be able to discover the different densities clusters, and retains the advantage of separating noise and finding arbitrary shape clusters. But, because of great memory demand and low calculation efficiency, Multi Density DBSCAN can't deal with large database. Therefore, GCMDDBSCAN is proposed in this paper, and within it 'migration-coefficient' conception is introduced firstly. In GCMDDBSCAN, with the grid technique, the optimization effect of contribution and migration-coefficient, and the efficient SP-tree query index, the runtime is reduced a lot, and the capability of clustering large database is obviously enhanced, at the same time, the accuracy of clustering result is not degraded.