使用平分k均值的数据聚类

V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh
{"title":"使用平分k均值的数据聚类","authors":"V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh","doi":"10.1109/ICCCIS48478.2019.8974537","DOIUrl":null,"url":null,"abstract":"Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.","PeriodicalId":436154,"journal":{"name":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Data Clustering using Bisecting K-Means\",\"authors\":\"V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh\",\"doi\":\"10.1109/ICCCIS48478.2019.8974537\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.\",\"PeriodicalId\":436154,\"journal\":{\"name\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCIS48478.2019.8974537\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS48478.2019.8974537","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

聚类是数据挖掘中最重要的技术之一。它被广泛应用于欺诈检测、图像处理、生物信息学等领域。它已被用于各个领域。聚类技术有:分层聚类、分区聚类、谱聚类、密度聚类、网格聚类、基于模型的聚类等。分割K-Means属于分区聚类。当使用大量数据时,它提供了更好的性能。在类似的领域中有许多方法被开发出来。其中一种技术是文本挖掘,通过文本提取有用的信息。其中一个重要的概念是统计模式挖掘,通过规划不同的趋势和模式提取重要的信息。输入文本模式是结构化的,它来源于结构化数据,并生成相应的输出。文本挖掘的步骤包括:文本分类、文本聚类、文本抽取、文本摘要、E-R建模。文本分析的各个步骤是:信息检索、文本分析和文本分析。词频分布分析、分布研究、模式识别、标注、信息提取、数据挖掘技术以及链接分析、关联、可视化。预测分析师。在给定的纸上等分。提出了K均值算法,该算法具有K均值和层次性的特点。集群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Data Clustering using Bisecting K-Means
Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Survey on Stress Emotion Recognition in Speech Weak Form Efficiency Of Currency Futures: Evidence From India YouTube Video Classification based on Title and Description Text SegNet-based Corpus Callosum segmentation for brain Magnetic Resonance Images (MRI) A synchronizer-mediator for lazy replicated databases over a decentralized P2P architecture
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1