{"title":"A LDA-Based Approach for Interactive Web Mining of Topic Evolutionary Patterns","authors":"Bin Zhou, Jiuming Huang, Kai-Yuan Cui","doi":"10.1109/ITAPP.2010.5566219","DOIUrl":null,"url":null,"abstract":"Many real-world Web mining tasks need to discover topics interactively, which means the users are likely to interfere the topic discovery and selection processes by expressing their preferences. In this paper, a new algorithm based on Latent Dirichlet Allocation (LDA) is proposed for interactive topic evolution pattern detection. To eliminate those topics not interested, it allows the users to add supervised information by adjusting the posterior topic-word distributions, which may influence the inference process of the following iteration. A framework is designed to incorporate different kinds of supervised information. Experiments on English and Chinese corpus show that the extracted topics capture meaningful themes and the suppervised information can help to find better topics more efficiently.","PeriodicalId":116013,"journal":{"name":"2010 International Conference on Internet Technology and Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 International Conference on Internet Technology and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ITAPP.2010.5566219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Many real-world Web mining tasks need to discover topics interactively, which means the users are likely to interfere the topic discovery and selection processes by expressing their preferences. In this paper, a new algorithm based on Latent Dirichlet Allocation (LDA) is proposed for interactive topic evolution pattern detection. To eliminate those topics not interested, it allows the users to add supervised information by adjusting the posterior topic-word distributions, which may influence the inference process of the following iteration. A framework is designed to incorporate different kinds of supervised information. Experiments on English and Chinese corpus show that the extracted topics capture meaningful themes and the suppervised information can help to find better topics more efficiently.