克服分布式RDFS推理中基于项划分的限制

SWIM '13 Pub Date : 2013-06-23 DOI:10.1145/2484712.2484719

Tugba Kulahcioglu, Hasan Bulut

{"title":"克服分布式RDFS推理中基于项划分的限制","authors":"Tugba Kulahcioglu, Hasan Bulut","doi":"10.1145/2484712.2484719","DOIUrl":null,"url":null,"abstract":"RDFS reasoning is carried out via joint terms of triples; accordingly, a distributed reasoning approach should bring together triples that have terms in common. To achieve this, term-based partitioning distributes triples to partitions based on the terms they include. However, skewed distribution of Semantic Web data results in unbalanced load distribution. A single peer should be able to handle even the largest partition, and this requirement limits scalability. This approach also suffers from data replication since a triple is sent to multiple partitions. In this paper, we propose a two-step method to overcome above limitations. Our RDFS specific term-based partitioning algorithm applies a selective distribution policy and distributes triples with minimum replication. Our schema-sensitive processing approach eliminates non-productive partitions, and enables processing of a partition regardless of its size. Resulting partitions reach full closure without repeating the global schema or without fix-point iteration as suggested by previous studies.","PeriodicalId":420849,"journal":{"name":"SWIM '13","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Overcoming limitations of term-based partitioning for distributed RDFS reasoning\",\"authors\":\"Tugba Kulahcioglu, Hasan Bulut\",\"doi\":\"10.1145/2484712.2484719\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"RDFS reasoning is carried out via joint terms of triples; accordingly, a distributed reasoning approach should bring together triples that have terms in common. To achieve this, term-based partitioning distributes triples to partitions based on the terms they include. However, skewed distribution of Semantic Web data results in unbalanced load distribution. A single peer should be able to handle even the largest partition, and this requirement limits scalability. This approach also suffers from data replication since a triple is sent to multiple partitions. In this paper, we propose a two-step method to overcome above limitations. Our RDFS specific term-based partitioning algorithm applies a selective distribution policy and distributes triples with minimum replication. Our schema-sensitive processing approach eliminates non-productive partitions, and enables processing of a partition regardless of its size. Resulting partitions reach full closure without repeating the global schema or without fix-point iteration as suggested by previous studies.\",\"PeriodicalId\":420849,\"journal\":{\"name\":\"SWIM '13\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"SWIM '13\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2484712.2484719\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"SWIM '13","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2484712.2484719","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

通过三元组的联合项进行RDFS推理;因此，分布式推理方法应该将具有共同术语的三元组组合在一起。为了实现这一点，基于术语的分区根据它们包含的术语将三元组分发到分区。然而，语义Web数据的倾斜分布导致负载分布不平衡。单个对等点应该能够处理最大的分区，而这一需求限制了可伸缩性。这种方法还会受到数据复制的影响，因为三元组被发送到多个分区。在本文中，我们提出了一种两步法来克服上述限制。我们的RDFS特定的基于项的分区算法应用选择性分布策略，并以最小的复制分布三元组。我们的模式敏感处理方法消除了非生产性分区，并支持处理分区，而不管其大小。结果分区在不重复全局模式或不像以前的研究建议的那样进行定点迭代的情况下达到完全闭包。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Overcoming limitations of term-based partitioning for distributed RDFS reasoning

RDFS reasoning is carried out via joint terms of triples; accordingly, a distributed reasoning approach should bring together triples that have terms in common. To achieve this, term-based partitioning distributes triples to partitions based on the terms they include. However, skewed distribution of Semantic Web data results in unbalanced load distribution. A single peer should be able to handle even the largest partition, and this requirement limits scalability. This approach also suffers from data replication since a triple is sent to multiple partitions. In this paper, we propose a two-step method to overcome above limitations. Our RDFS specific term-based partitioning algorithm applies a selective distribution policy and distributes triples with minimum replication. Our schema-sensitive processing approach eliminates non-productive partitions, and enables processing of a partition regardless of its size. Resulting partitions reach full closure without repeating the global schema or without fix-point iteration as suggested by previous studies.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助