使用平分k均值的数据聚类

2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS) Pub Date : 2019-10-01 DOI:10.1109/ICCCIS48478.2019.8974537

V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh

{"title":"使用平分k均值的数据聚类","authors":"V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh","doi":"10.1109/ICCCIS48478.2019.8974537","DOIUrl":null,"url":null,"abstract":"Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.","PeriodicalId":436154,"journal":{"name":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Data Clustering using Bisecting K-Means\",\"authors\":\"V. Rohilla, Ms Sanika Singh kumar, Sudeshna Chakraborty, Ms. Sanika Singh\",\"doi\":\"10.1109/ICCCIS48478.2019.8974537\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.\",\"PeriodicalId\":436154,\"journal\":{\"name\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"volume\":\"46 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCIS48478.2019.8974537\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCIS48478.2019.8974537","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

聚类是数据挖掘中最重要的技术之一。它被广泛应用于欺诈检测、图像处理、生物信息学等领域。它已被用于各个领域。聚类技术有:分层聚类、分区聚类、谱聚类、密度聚类、网格聚类、基于模型的聚类等。分割K-Means属于分区聚类。当使用大量数据时，它提供了更好的性能。在类似的领域中有许多方法被开发出来。其中一种技术是文本挖掘，通过文本提取有用的信息。其中一个重要的概念是统计模式挖掘，通过规划不同的趋势和模式提取重要的信息。输入文本模式是结构化的，它来源于结构化数据，并生成相应的输出。文本挖掘的步骤包括:文本分类、文本聚类、文本抽取、文本摘要、E-R建模。文本分析的各个步骤是:信息检索、文本分析和文本分析。词频分布分析、分布研究、模式识别、标注、信息提取、数据挖掘技术以及链接分析、关联、可视化。预测分析师。在给定的纸上等分。提出了K均值算法，该算法具有K均值和层次性的特点。集群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Data Clustering using Bisecting K-Means

Clustering is one the one of the most important technique of data mining. It is used in many applications like fraud detection, image processing, bioinformatics etc. It has been used in various domains. Many types of the clustering techniques are the following like hierarchical, partitional, spectral clustering, density clustering, grid clustering, model based clustering etc. Bisecting K-Means comes under partitional clustering. It gives better performane, when huge data is used. There are many approached that are developed in the similar domain.One of the technique is Text Mining through which useful information is extracted through text. One of the important concept is statistical pattern mining through which important information is extracted by planning different trends and patterns. Input text patterns are structured that are derived from structured data and corresponding output is generated. The steps of text mining are categories of text, clustering text, extraction, summarization of text, E-R modeling. The various steps of text analysis are retrieval of information, lex. analysis for distribution of word freq. distribution study, recognition of pattern,tagging, extraction of information, techniques of data mining and also link analysis, association, visual. and predictive analyt. In the given paper bisect. K Means algorithm is presented which has the features of k-Means and hierar. clustering.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2019 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)

自引率

0.00%

发文量

期刊最新文献

Survey on Stress Emotion Recognition in Speech Weak Form Efficiency Of Currency Futures: Evidence From India YouTube Video Classification based on Title and Description Text SegNet-based Corpus Callosum segmentation for brain Magnetic Resonance Images (MRI) A synchronizer-mediator for lazy replicated databases over a decentralized P2P architecture