{"title":"High Performance RDF Updates with TripleBit +","authors":"Pingpeng Yuan, Lijian Fan, Hai Jin","doi":"10.1109/ICDIM.2018.8847004","DOIUrl":null,"url":null,"abstract":"The volume of RDF data continues to grow over the past decade and many known RDF datasets have billions of triples. A grant challenge of managing this huge RDF data is how to access this big RDF data efficiently. A popular approach to addressing the problem is to build a full set of permutations of (S, P, O) indexes. Although this approach has shown to accelerate joins by orders of magnitude, the large space overhead limits the scalability of this approach and makes it heavyweight. In this paper, we present TripleBit +, a fast and compact system for updating RDF data. The design of TripleBit + has two salient features. First, the efficient maintenance strategies of TripleBit + reduces both the overhead to update data and indexes. Second, effective maintenance technologies to handle online updates over RDF repositories are proposed. Our experiments show that TripleBit + outperforms RDF-3X, MonetDB, BitMat on LUBM, UniProt, and BTC 2012 benchmark queries and it offers orders of mangnitude performance improvement for some complex join queries. Our design also yields high task rates as high as 660,000 per second and fast average response time of task which is faster than x-RDF-3X and PostgreSQL.","PeriodicalId":120884,"journal":{"name":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Thirteenth International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2018.8847004","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The volume of RDF data continues to grow over the past decade and many known RDF datasets have billions of triples. A grant challenge of managing this huge RDF data is how to access this big RDF data efficiently. A popular approach to addressing the problem is to build a full set of permutations of (S, P, O) indexes. Although this approach has shown to accelerate joins by orders of magnitude, the large space overhead limits the scalability of this approach and makes it heavyweight. In this paper, we present TripleBit +, a fast and compact system for updating RDF data. The design of TripleBit + has two salient features. First, the efficient maintenance strategies of TripleBit + reduces both the overhead to update data and indexes. Second, effective maintenance technologies to handle online updates over RDF repositories are proposed. Our experiments show that TripleBit + outperforms RDF-3X, MonetDB, BitMat on LUBM, UniProt, and BTC 2012 benchmark queries and it offers orders of mangnitude performance improvement for some complex join queries. Our design also yields high task rates as high as 660,000 per second and fast average response time of task which is faster than x-RDF-3X and PostgreSQL.