{"title":"基于密度分析的k -介质方法搜索结果聚类","authors":"Hungming Hung, J. Watada","doi":"10.1109/IIAI-AAI.2014.41","DOIUrl":null,"url":null,"abstract":"After obtaining search results through web search engine, classifying into clusters enables us to quickly browse them. Currently, famous search engines like Google, Bing and Baidu always return a long list of web pages which can be more than a hundred million that are ranked by their relevancies to the search key words. Users are forced to examine the results to look for their required information. This consumes a lot of time when the results come into so huge a number that consisting various kinds. Traditional clustering techniques are inadequate for readable descriptions. In this research, we first build a local semantic thesaurus (L.S.T) to transform natural language into two dimensional numerical points. Second, we analyze and gather different attributes of the search results so as to cluster them through on density analysis based K-Medoids method. Without defining categories in advance, K-Medoids method generates clusters with less susceptibility to noise. Experimental results verify our method's feasibility and effectiveness.","PeriodicalId":432222,"journal":{"name":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Search Result Clustering through Density Analysis Based K-Medoids Method\",\"authors\":\"Hungming Hung, J. Watada\",\"doi\":\"10.1109/IIAI-AAI.2014.41\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"After obtaining search results through web search engine, classifying into clusters enables us to quickly browse them. Currently, famous search engines like Google, Bing and Baidu always return a long list of web pages which can be more than a hundred million that are ranked by their relevancies to the search key words. Users are forced to examine the results to look for their required information. This consumes a lot of time when the results come into so huge a number that consisting various kinds. Traditional clustering techniques are inadequate for readable descriptions. In this research, we first build a local semantic thesaurus (L.S.T) to transform natural language into two dimensional numerical points. Second, we analyze and gather different attributes of the search results so as to cluster them through on density analysis based K-Medoids method. Without defining categories in advance, K-Medoids method generates clusters with less susceptibility to noise. Experimental results verify our method's feasibility and effectiveness.\",\"PeriodicalId\":432222,\"journal\":{\"name\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IIAI 3rd International Conference on Advanced Applied Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IIAI-AAI.2014.41\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IIAI 3rd International Conference on Advanced Applied Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIAI-AAI.2014.41","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Search Result Clustering through Density Analysis Based K-Medoids Method
After obtaining search results through web search engine, classifying into clusters enables us to quickly browse them. Currently, famous search engines like Google, Bing and Baidu always return a long list of web pages which can be more than a hundred million that are ranked by their relevancies to the search key words. Users are forced to examine the results to look for their required information. This consumes a lot of time when the results come into so huge a number that consisting various kinds. Traditional clustering techniques are inadequate for readable descriptions. In this research, we first build a local semantic thesaurus (L.S.T) to transform natural language into two dimensional numerical points. Second, we analyze and gather different attributes of the search results so as to cluster them through on density analysis based K-Medoids method. Without defining categories in advance, K-Medoids method generates clusters with less susceptibility to noise. Experimental results verify our method's feasibility and effectiveness.