DBSCAN Clustering Algorithm Based on Density

Dingsheng Deng
{"title":"DBSCAN Clustering Algorithm Based on Density","authors":"Dingsheng Deng","doi":"10.1109/IFEEA51475.2020.00199","DOIUrl":null,"url":null,"abstract":"Clustering technology has important applications in data mining, pattern recognition, machine learning and other fields. However, with the explosive growth of data, traditional clustering algorithm is more and more difficult to meet the needs of big data analysis. How to improve the traditional clustering algorithm and ensure the quality and efficiency of clustering under the background of big data has become an important research topic of artificial intelligence and big data processing. The density-based clustering algorithm can cluster arbitrarily shaped data sets in the case of unknown data distribution. DBSCAN is a classical density-based clustering algorithm, which is widely used for data clustering analysis due to its simple and efficient characteristics. The purpose of this paper is to study DBSCAN clustering algorithm based on density. This paper first introduces the concept of DBSCAN algorithm, and then carries out performance tests on DBSCAN algorithm in three different data sets. By analyzing the experimental results, it can be concluded that DBSCAN algorithm has higher homogeneity and diversity when it performs personalized clustering on data sets of non-uniform density with broad values and gradually sparse forwards. When the DBSCAN algorithm's neighborhood distance eps is 1000, 26 classes are generated after clustering.","PeriodicalId":285980,"journal":{"name":"2020 7th International Forum on Electrical Engineering and Automation (IFEEA)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"40","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 7th International Forum on Electrical Engineering and Automation (IFEEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IFEEA51475.2020.00199","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 40

Abstract

Clustering technology has important applications in data mining, pattern recognition, machine learning and other fields. However, with the explosive growth of data, traditional clustering algorithm is more and more difficult to meet the needs of big data analysis. How to improve the traditional clustering algorithm and ensure the quality and efficiency of clustering under the background of big data has become an important research topic of artificial intelligence and big data processing. The density-based clustering algorithm can cluster arbitrarily shaped data sets in the case of unknown data distribution. DBSCAN is a classical density-based clustering algorithm, which is widely used for data clustering analysis due to its simple and efficient characteristics. The purpose of this paper is to study DBSCAN clustering algorithm based on density. This paper first introduces the concept of DBSCAN algorithm, and then carries out performance tests on DBSCAN algorithm in three different data sets. By analyzing the experimental results, it can be concluded that DBSCAN algorithm has higher homogeneity and diversity when it performs personalized clustering on data sets of non-uniform density with broad values and gradually sparse forwards. When the DBSCAN algorithm's neighborhood distance eps is 1000, 26 classes are generated after clustering.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于密度的DBSCAN聚类算法
聚类技术在数据挖掘、模式识别、机器学习等领域有着重要的应用。然而,随着数据的爆炸式增长,传统的聚类算法越来越难以满足大数据分析的需求。如何在大数据背景下改进传统的聚类算法,保证聚类的质量和效率,已成为人工智能和大数据处理的重要研究课题。基于密度的聚类算法可以在数据分布未知的情况下对任意形状的数据集进行聚类。DBSCAN是一种经典的基于密度的聚类算法,以其简单高效的特点被广泛应用于数据聚类分析。本文的目的是研究基于密度的DBSCAN聚类算法。本文首先介绍了DBSCAN算法的概念,然后在三个不同的数据集上对DBSCAN算法进行了性能测试。通过对实验结果的分析,可以得出DBSCAN算法在对非均匀密度的数据集进行个性化聚类时具有较高的同质性和多样性。当DBSCAN算法的邻域距离eps为1000时,聚类后生成26个类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
From Majorana fermions to topological quantum computation The effect of duct on the inflow velocity at the propeller plane Multi-market Bidding Model for Generation Enterprises Based on Mean-CVaR Theory Research on reliability Optimization design of multilayer porcelain dielectric capacitance of aerospace electronic products based on dynamic simulation Research on Normalization Construction of Electric Grid
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1