{"title":"Textual Document Clustering in Traditional and Modern Approaches (Review)","authors":"W. Yafooz, Z. Bakar, A. Mithun","doi":"10.1109/SPC.2018.8704130","DOIUrl":null,"url":null,"abstract":"An enormous quantity of textual documents is created from the advanced technological use concerning describing, intelligence, interconnection, and thousands of distinct authorizations and was expanding each moment of quotidian circumstances. The Clustering is an automated established process to organize the database on features. Outwardly implementing a clustering technique to textual data, a huge quantity of unstructured data is losing the capability of sharing knowledge. There are many tools and techniques proposed. This paper present and categorized the textual document clustering algorithms (approaches) into two types are classical and modern approaches. Both approaches are implemented to those textual data to obtain and consolidate knowledge from discharged to an extraordinary impression of a prepared document. The two important factors in clustering process are speed of clustering process and accuracy or purity of data clusters. This review paper can be benefits to many researchers who concern on textual document clustering, text mining and data scientist.","PeriodicalId":432464,"journal":{"name":"2018 IEEE Conference on Systems, Process and Control (ICSPC)","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE Conference on Systems, Process and Control (ICSPC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPC.2018.8704130","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
An enormous quantity of textual documents is created from the advanced technological use concerning describing, intelligence, interconnection, and thousands of distinct authorizations and was expanding each moment of quotidian circumstances. The Clustering is an automated established process to organize the database on features. Outwardly implementing a clustering technique to textual data, a huge quantity of unstructured data is losing the capability of sharing knowledge. There are many tools and techniques proposed. This paper present and categorized the textual document clustering algorithms (approaches) into two types are classical and modern approaches. Both approaches are implemented to those textual data to obtain and consolidate knowledge from discharged to an extraordinary impression of a prepared document. The two important factors in clustering process are speed of clustering process and accuracy or purity of data clusters. This review paper can be benefits to many researchers who concern on textual document clustering, text mining and data scientist.