{"title":"基于搜索结果分析的语义相关度计算","authors":"Jiangjiao Duan, Jianping Zeng","doi":"10.1109/WI-IAT.2012.29","DOIUrl":null,"url":null,"abstract":"Automatically computing the semantic relatedness of two words is an essential step for many tasks in natural language processing, including information retrieval. Previous approaches to computing semantic relatedness used statistical techniques or lexical resources. We propose Searcher Result Analysis (SRA), a novel method that captures related text from search engine by issuing proper queries. Inferring the relatedness is then based on word occurrences in certain number of pages. Compared with the previous state of the art, using SRA to computing semantic relatedness based on Wikipedia can achieve competitive results with no need to maintain a local copy of remote resources. It is also shown that the correctness can be further improved by selecting proper knowledge resources or corpora for SRA.","PeriodicalId":220218,"journal":{"name":"2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Computing Semantic Relatedness Based on Search Result Analysis\",\"authors\":\"Jiangjiao Duan, Jianping Zeng\",\"doi\":\"10.1109/WI-IAT.2012.29\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatically computing the semantic relatedness of two words is an essential step for many tasks in natural language processing, including information retrieval. Previous approaches to computing semantic relatedness used statistical techniques or lexical resources. We propose Searcher Result Analysis (SRA), a novel method that captures related text from search engine by issuing proper queries. Inferring the relatedness is then based on word occurrences in certain number of pages. Compared with the previous state of the art, using SRA to computing semantic relatedness based on Wikipedia can achieve competitive results with no need to maintain a local copy of remote resources. It is also shown that the correctness can be further improved by selecting proper knowledge resources or corpora for SRA.\",\"PeriodicalId\":220218,\"journal\":{\"name\":\"2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI-IAT.2012.29\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI-IAT.2012.29","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Computing Semantic Relatedness Based on Search Result Analysis
Automatically computing the semantic relatedness of two words is an essential step for many tasks in natural language processing, including information retrieval. Previous approaches to computing semantic relatedness used statistical techniques or lexical resources. We propose Searcher Result Analysis (SRA), a novel method that captures related text from search engine by issuing proper queries. Inferring the relatedness is then based on word occurrences in certain number of pages. Compared with the previous state of the art, using SRA to computing semantic relatedness based on Wikipedia can achieve competitive results with no need to maintain a local copy of remote resources. It is also shown that the correctness can be further improved by selecting proper knowledge resources or corpora for SRA.