{"title":"Cluster of tweet users based on optimal set","authors":"Amit Paul, Animesh Dutta, Frans Coenen","doi":"10.1109/TENCON.2016.7848008","DOIUrl":null,"url":null,"abstract":"Over the years or even decades, researchers are dealing with the problem of duplicate clusters or overlapping clusters in a cluster set. Clusters overlap within each other just as in the case of social networking groups, or grouping movies by genre. In this paper, hierarchical form of clustering is used to cluster user based on interaction which creates numerous clusters with different sizes at different hierarchical level. In doing so, many overlapping clusters are generated but duplicates are not removed. Duplicity possesses a challenge for differentiation. Our work here is two fold. Firstly, to cluster users with different hierarchical levels to generate sets of clusters by level and secondly, to find among the different cluster sets the optimal one by simply using mean and standard deviation. The sense of optimality is different for different requirements. Our work shows that we can have a choice of picking the optimal set by requirement.","PeriodicalId":246458,"journal":{"name":"2016 IEEE Region 10 Conference (TENCON)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE Region 10 Conference (TENCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2016.7848008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Over the years or even decades, researchers are dealing with the problem of duplicate clusters or overlapping clusters in a cluster set. Clusters overlap within each other just as in the case of social networking groups, or grouping movies by genre. In this paper, hierarchical form of clustering is used to cluster user based on interaction which creates numerous clusters with different sizes at different hierarchical level. In doing so, many overlapping clusters are generated but duplicates are not removed. Duplicity possesses a challenge for differentiation. Our work here is two fold. Firstly, to cluster users with different hierarchical levels to generate sets of clusters by level and secondly, to find among the different cluster sets the optimal one by simply using mean and standard deviation. The sense of optimality is different for different requirements. Our work shows that we can have a choice of picking the optimal set by requirement.