Gulmira Tolegen, Alymzhan Toleu, R. Mussabayev, Alexander Krassovitskiy
{"title":"A Clustering-based Approach for Topic Modeling via Word Network Analysis","authors":"Gulmira Tolegen, Alymzhan Toleu, R. Mussabayev, Alexander Krassovitskiy","doi":"10.1109/UBMK55850.2022.9919530","DOIUrl":null,"url":null,"abstract":"This paper presents a clustering-based approach to topic modeling via analyzing word networks based on the adaptation of a community detection algorithm. Word networks are constructed with different word representations, and two types of topic assignments are introduced. Topic coherence score and the document clustering results are reported for topic model evaluation. Experimental results showed that it achieved comparable results with the current best. It also showed that the proposed approach produced a higher performance as the number of most relevant words gets larger in $C_{cv}$ coherence score.","PeriodicalId":417604,"journal":{"name":"2022 7th International Conference on Computer Science and Engineering (UBMK)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Computer Science and Engineering (UBMK)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UBMK55850.2022.9919530","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a clustering-based approach to topic modeling via analyzing word networks based on the adaptation of a community detection algorithm. Word networks are constructed with different word representations, and two types of topic assignments are introduced. Topic coherence score and the document clustering results are reported for topic model evaluation. Experimental results showed that it achieved comparable results with the current best. It also showed that the proposed approach produced a higher performance as the number of most relevant words gets larger in $C_{cv}$ coherence score.