{"title":"Identifying main topics in density-based spatial clusters using network-based representative document extraction","authors":"Tatsuhiro Sakai, Keiichi Tamura, H. Kitakami","doi":"10.1109/IWCIA.2015.7449466","DOIUrl":null,"url":null,"abstract":"Geo-tagged documents on social media are usually related to local topics and events. Extracting areas of interest associated with local “attractive” topics from geo-tagged documents is one of the most important challenges in many application domains. In this paper, we propose a novel method for extracting the areas of interest from geo-tagged documents. There are two main steps in the proposed method. First, the (ε, σ)-density-based adaptive spatial clustering algorithm extracts areas where local topics are attracting attention as spatial clusters. Second, representative geo-tagged documents are detected to identify the main topic in each spatial cluster. The (ε, σ)-density-based adaptive spatial clustering algorithm changes the threshold for seamlessly extracting spatial clusters regardless of the local densities of the posted geo-tagged documents. Moreover, the proposed method utilizes the network-based important sentence extraction method in order to extract representative geo-tagged documents from each spatial cluster. The experimental results show that the proposed method can extract the areas of interest as spatial clusters and representative documents as main topics.","PeriodicalId":298756,"journal":{"name":"2015 IEEE 8th International Workshop on Computational Intelligence and Applications (IWCIA)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 8th International Workshop on Computational Intelligence and Applications (IWCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IWCIA.2015.7449466","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Geo-tagged documents on social media are usually related to local topics and events. Extracting areas of interest associated with local “attractive” topics from geo-tagged documents is one of the most important challenges in many application domains. In this paper, we propose a novel method for extracting the areas of interest from geo-tagged documents. There are two main steps in the proposed method. First, the (ε, σ)-density-based adaptive spatial clustering algorithm extracts areas where local topics are attracting attention as spatial clusters. Second, representative geo-tagged documents are detected to identify the main topic in each spatial cluster. The (ε, σ)-density-based adaptive spatial clustering algorithm changes the threshold for seamlessly extracting spatial clusters regardless of the local densities of the posted geo-tagged documents. Moreover, the proposed method utilizes the network-based important sentence extraction method in order to extract representative geo-tagged documents from each spatial cluster. The experimental results show that the proposed method can extract the areas of interest as spatial clusters and representative documents as main topics.