Towards best-effort merge of taxonomically organized data

2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010) Pub Date : 2010-03-01 DOI:10.1109/ICDEW.2010.5452756

D. Thau, S. Bowers, Bertram Ludäscher

引用次数: 5

Abstract

We consider the task of merging datasets that have been organized using different, but aligned taxonomies. We assume such a merge is intended to create a single dataset that unambiguously describes the information in the source datasets using the alignment. We also assume that the merged result should reflect the observations of the datasets as specifically as possible. Typically, there will be no single merge result that is both unambiguous and maximally specific. In this case, a user may be provided with a set of possible merged datasets. If the user requires a single dataset, that dataset loses specificity. Here we examine whether the data exchange setting can provide a way to derive a ¿best-effort¿ merge. We find that the data exchange setting might be a good candidate for providing the merge, but further research is needed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

实现分类组织数据的最佳合并

我们考虑合并使用不同但一致的分类法组织的数据集的任务。我们假设这样的合并是为了创建一个单一的数据集，该数据集使用对齐来明确地描述源数据集中的信息。我们还假设合并的结果应尽可能具体地反映数据集的观测结果。通常，不会有一个合并结果是明确的和最大程度的特定的。在这种情况下，可能会向用户提供一组可能合并的数据集。如果用户需要单个数据集，则该数据集将失去特异性。在这里，我们检查数据交换设置是否可以提供一种获得“尽力而为”合并的方法。我们发现数据交换设置可能是提供合并的一个很好的候选，但需要进一步的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010)

自引率

0.00%

发文量

期刊最新文献

Fast algorithms for time series mining Ontology alignment argumentation with mutual dependency between arguments and mappings A first step towards integration independence Towards enterprise software as a service in the cloud U-DBSCAN : A density-based clustering algorithm for uncertain objects