Structural joins: a primitive for efficient XML query pattern matching

Proceedings 18th International Conference on Data Engineering Pub Date : 2002-08-07 DOI:10.1109/ICDE.2002.994704

S. Al-Khalifa, H. Jagadish, Nick Koudas, J. Patel, D. Srivastava, Yuqing Wu

{"title":"Structural joins: a primitive for efficient XML query pattern matching","authors":"S. Al-Khalifa, H. Jagadish, Nick Koudas, J. Patel, D. Srivastava, Yuqing Wu","doi":"10.1109/ICDE.2002.994704","DOIUrl":null,"url":null,"abstract":"XML queries typically specify patterns of selection predicates on multiple elements that have some specified tree structured relationships. The primitive tree structured relationships are parent-child and ancestor-descendant, and finding all occurrences of these relationships in an XML database is a core operation for XML query processing. We develop two families of structural join algorithms for this task: tree-merge and stack-tree. The tree-merge algorithms are a natural extension of traditional merge joins and the multi-predicate merge joins, while the stack-tree algorithms have no counterpart in traditional relational join processing. We present experimental results on a range of data and queries using the TIMBER native XML query engine built on top of SHORE. We show that while, in some cases, tree-merge algorithms can have performance comparable to stack-tree algorithms, in many cases they are considerably worse. This behavior is explained by analytical results that demonstrate that, on sorted inputs, the stack-tree algorithms have worst-case I/O and CPU complexities linear in the sum of the sizes of inputs and output, while the tree-merge algorithms do not have the same guarantee.","PeriodicalId":191529,"journal":{"name":"Proceedings 18th International Conference on Data Engineering","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"890","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 18th International Conference on Data Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDE.2002.994704","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 890

Abstract

XML queries typically specify patterns of selection predicates on multiple elements that have some specified tree structured relationships. The primitive tree structured relationships are parent-child and ancestor-descendant, and finding all occurrences of these relationships in an XML database is a core operation for XML query processing. We develop two families of structural join algorithms for this task: tree-merge and stack-tree. The tree-merge algorithms are a natural extension of traditional merge joins and the multi-predicate merge joins, while the stack-tree algorithms have no counterpart in traditional relational join processing. We present experimental results on a range of data and queries using the TIMBER native XML query engine built on top of SHORE. We show that while, in some cases, tree-merge algorithms can have performance comparable to stack-tree algorithms, in many cases they are considerably worse. This behavior is explained by analytical results that demonstrate that, on sorted inputs, the stack-tree algorithms have worst-case I/O and CPU complexities linear in the sum of the sizes of inputs and output, while the tree-merge algorithms do not have the same guarantee.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

结构连接:用于高效XML查询模式匹配的原语

XML查询通常在具有某些指定的树结构关系的多个元素上指定选择谓词的模式。原始树结构关系是父子关系和祖先-后代关系，在XML数据库中查找这些关系的所有出现是XML查询处理的核心操作。我们为此开发了两类结构连接算法:树合并和堆栈树。树合并算法是传统合并连接和多谓词合并连接的自然扩展，而堆栈树算法在传统的关系连接处理中没有对应的算法。我们使用构建在SHORE之上的TIMBER原生XML查询引擎，给出了一系列数据和查询的实验结果。我们表明，虽然在某些情况下，树合并算法可以具有与堆栈树算法相当的性能，但在许多情况下，它们要差得多。分析结果解释了这种行为，分析结果表明，在排序输入上，堆栈树算法的最坏情况I/O和CPU复杂性在输入和输出大小的总和中呈线性，而树合并算法没有相同的保证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings 18th International Conference on Data Engineering

自引率

0.00%

发文量

期刊最新文献

Out from under the trees [linear file template] Declarative composition and peer-to-peer provisioning of dynamic Web services Multivariate time series prediction via temporal classification Integrating workflow management systems with business-to-business interaction standards YFilter: efficient and scalable filtering of XML documents