Entity Resolution with Markov Logic

Parag Singla, Pedro M. Domingos
{"title":"Entity Resolution with Markov Logic","authors":"Parag Singla, Pedro M. Domingos","doi":"10.1109/ICDM.2006.65","DOIUrl":null,"url":null,"abstract":"Entity resolution is the problem of determining which records in a database refer to the same entities, and is a crucial and expensive step in the data mining process. Interest in it has grown rapidly, and many approaches have been proposed. However, they tend to address only isolated aspects of the problem, and are often ad hoc. This paper proposes a well-founded, integrated solution to the entity resolution problem based on Markov logic. Markov logic combines first-order logic and probabilistic graphical models by attaching weights to first-order formulas, and viewing them as templates for features of Markov networks. We show how a number of previous approaches can be formulated and seamlessly combined in Markov logic, and how the resulting learning and inference problems can be solved efficiently. Experiments on two citation databases show the utility of this approach, and evaluate the contribution of the different components.","PeriodicalId":356443,"journal":{"name":"Sixth International Conference on Data Mining (ICDM'06)","volume":"34 6","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"428","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Data Mining (ICDM'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM.2006.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 428

Abstract

Entity resolution is the problem of determining which records in a database refer to the same entities, and is a crucial and expensive step in the data mining process. Interest in it has grown rapidly, and many approaches have been proposed. However, they tend to address only isolated aspects of the problem, and are often ad hoc. This paper proposes a well-founded, integrated solution to the entity resolution problem based on Markov logic. Markov logic combines first-order logic and probabilistic graphical models by attaching weights to first-order formulas, and viewing them as templates for features of Markov networks. We show how a number of previous approaches can be formulated and seamlessly combined in Markov logic, and how the resulting learning and inference problems can be solved efficiently. Experiments on two citation databases show the utility of this approach, and evaluate the contribution of the different components.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
马尔可夫逻辑的实体解析
实体解析是确定数据库中哪些记录引用相同实体的问题,是数据挖掘过程中至关重要且代价高昂的一步。人们对它的兴趣迅速增长,并提出了许多方法。然而,它们往往只处理问题的孤立方面,而且往往是特别的。本文提出了一种基于马尔可夫逻辑的实体解析问题的综合解决方案。马尔可夫逻辑结合了一阶逻辑和概率图形模型,将权重附加到一阶公式中,并将其视为马尔可夫网络特征的模板。我们展示了如何在马尔可夫逻辑中制定和无缝结合许多先前的方法,以及如何有效地解决由此产生的学习和推理问题。在两个引文数据库上的实验表明了该方法的有效性,并评估了不同成分的贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Improving Nearest Neighbor Classifier Using Tabu Search and Ensemble Distance Metrics Frequent Closed Itemset Mining Using Prefix Graphs with an Efficient Flow-Based Pruning Strategy Semantic Kernels for Text Classification Based on Topological Measures of Feature Similarity High-Performance Unsupervised Relation Extraction from Large Corpora Multi-Tier Granule Mining for Representations of Multidimensional Association Rules
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1