Myunggwon Hwang, Hyunjang Kong, Sunkyoung Baek, Pankoo Kim
{"title":"TSM. Topic Selection Method of Web Documents","authors":"Myunggwon Hwang, Hyunjang Kong, Sunkyoung Baek, Pankoo Kim","doi":"10.1109/AMS.2007.108","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a topic selection method about Web documents. The idea of our approach is to utilize an ontology structure and TF (term frequency) values of each term. For improving the performance of documents clustering, our research is strongly demanded. We process Web documents for keywords acquisition using TF values and relevancy values between terms using relations defined in WordNet. And then, we proposed the topic selection formula as we consider three kinds of cases during the topic selection. In conclusion, we demonstrate that our approach is very useful for the topic selection of documents","PeriodicalId":198751,"journal":{"name":"First Asia International Conference on Modelling & Simulation (AMS'07)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"First Asia International Conference on Modelling & Simulation (AMS'07)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AMS.2007.108","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we propose a topic selection method about Web documents. The idea of our approach is to utilize an ontology structure and TF (term frequency) values of each term. For improving the performance of documents clustering, our research is strongly demanded. We process Web documents for keywords acquisition using TF values and relevancy values between terms using relations defined in WordNet. And then, we proposed the topic selection formula as we consider three kinds of cases during the topic selection. In conclusion, we demonstrate that our approach is very useful for the topic selection of documents