Haojian Zhong, Lida Xu, Cheng Xie, Boyi Xu, Fenglin Bu, Hongming Cai
{"title":"A Similarity Graph Matching Approach for Instance Disambiguation","authors":"Haojian Zhong, Lida Xu, Cheng Xie, Boyi Xu, Fenglin Bu, Hongming Cai","doi":"10.1109/ES.2016.9","DOIUrl":null,"url":null,"abstract":"Instance matching acts as a significant part of information integration in semantic web research. While ontology matching focuses on the schema level of data, instance matching deals with massive instances objects. Ambiguation is a common problem which may lead to error matching when different instances share the same names or descriptions. To cope with this problem structural approach is used by many matching systems for disambiguation. However, existing structural approach has a hidden problem named 'error propagation' which would affect the precision of matching result. In this paper, we investigate instance matching techniques and propose a new instance matching framework. It is based on a novel structural matching algorithm which calculates similarity separately on sub graphs. The structural information is fully taken advantage of to realize disambiguation and several indexing strategies are used to cut down the computing overhead. We have conducted experiments on instance matching benchmark and results show that our proposed matching approach is comparable to state-of-art systems. And experiment on real dataset has proved the validity of our approach in instance disambiguation.","PeriodicalId":184435,"journal":{"name":"2016 4th International Conference on Enterprise Systems (ES)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 4th International Conference on Enterprise Systems (ES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ES.2016.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Instance matching acts as a significant part of information integration in semantic web research. While ontology matching focuses on the schema level of data, instance matching deals with massive instances objects. Ambiguation is a common problem which may lead to error matching when different instances share the same names or descriptions. To cope with this problem structural approach is used by many matching systems for disambiguation. However, existing structural approach has a hidden problem named 'error propagation' which would affect the precision of matching result. In this paper, we investigate instance matching techniques and propose a new instance matching framework. It is based on a novel structural matching algorithm which calculates similarity separately on sub graphs. The structural information is fully taken advantage of to realize disambiguation and several indexing strategies are used to cut down the computing overhead. We have conducted experiments on instance matching benchmark and results show that our proposed matching approach is comparable to state-of-art systems. And experiment on real dataset has proved the validity of our approach in instance disambiguation.