基于知识库的网络搜索中人的消歧

2007 IEEE International Conference on Research, Innovation and Vision for the Future Pub Date : 2007-03-05 DOI:10.1109/RIVF.2007.369155

Q. Vu, Tomonari Masada, A. Takasu, J. Adachi

{"title":"基于知识库的网络搜索中人的消歧","authors":"Q. Vu, Tomonari Masada, A. Takasu, J. Adachi","doi":"10.1109/RIVF.2007.369155","DOIUrl":null,"url":null,"abstract":"Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different people, an effective method is needed to measure document similarities and to find documents related to the same person. Some previous researchers have used the vector space model or have tried to extract common named entities for measuring similarities. We propose a new method that uses Web directories as a knowledge base to find shared contexts in document pairs and uses the measurement of shared contexts to determine similarities between document pairs. Experimental results show that our proposed method outperforms the vector space model method and the named entity recognition method.","PeriodicalId":158887,"journal":{"name":"2007 IEEE International Conference on Research, Innovation and Vision for the Future","volume":"7 4-6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Disambiguation of People in Web Search Using a Knowledge Base\",\"authors\":\"Q. Vu, Tomonari Masada, A. Takasu, J. Adachi\",\"doi\":\"10.1109/RIVF.2007.369155\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different people, an effective method is needed to measure document similarities and to find documents related to the same person. Some previous researchers have used the vector space model or have tried to extract common named entities for measuring similarities. We propose a new method that uses Web directories as a knowledge base to find shared contexts in document pairs and uses the measurement of shared contexts to determine similarities between document pairs. Experimental results show that our proposed method outperforms the vector space model method and the named entity recognition method.\",\"PeriodicalId\":158887,\"journal\":{\"name\":\"2007 IEEE International Conference on Research, Innovation and Vision for the Future\",\"volume\":\"7 4-6\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-03-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2007 IEEE International Conference on Research, Innovation and Vision for the Future\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RIVF.2007.369155\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2007 IEEE International Conference on Research, Innovation and Vision for the Future","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RIVF.2007.369155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

摘要

由于同名问题，按个人姓名查询的结果通常包含与几个人相关的文档。为了区分涉及不同人的文档，需要一种有效的方法来度量文档的相似度，并找到涉及同一人的文档。以前的一些研究人员已经使用向量空间模型或试图提取共同命名实体来测量相似性。本文提出了一种以Web目录为知识库来查找文档对中的共享上下文，并利用共享上下文度量来确定文档对之间的相似度的方法。实验结果表明，该方法优于向量空间模型方法和命名实体识别方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Disambiguation of People in Web Search Using a Knowledge Base

Results of queries by personal names often contain documents related to several people because of the namesake problem. In order to differentiate documents related to different people, an effective method is needed to measure document similarities and to find documents related to the same person. Some previous researchers have used the vector space model or have tried to extract common named entities for measuring similarities. We propose a new method that uses Web directories as a knowledge base to find shared contexts in document pairs and uses the measurement of shared contexts to determine similarities between document pairs. Experimental results show that our proposed method outperforms the vector space model method and the named entity recognition method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2007 IEEE International Conference on Research, Innovation and Vision for the Future

自引率

0.00%

发文量