首页 > 最新文献

Proceedings 14th International Conference on Data Engineering最新文献

英文 中文
Red Brick Vista/sup TM/: aggregate computation and management 红砖Vista/sup TM/:聚合计算和管理
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655773
Latha S. Colby, R. Cole, E. Haslam, N. Jazayeri, Galt Johnson, William J. McKenna, L. Schumacher, David Wilhite
Aggregate query processing in large data warehouses is computationally intensive. Precomputation is an approach that can be used to speed up aggregate queries. However, in order to make precomputation a truly viable solution to the aggregate query processing problem, it is important to identify the best set of aggregates to precompute and to use these precomputed aggregates effectively. The Red Brick aggregate computation and management system (Red Brick Vista) provides a complete server integrated solution to these problems.
大型数据仓库中的聚合查询处理是计算密集型的。预计算是一种可以用来加速聚合查询的方法。然而,为了使预计算成为聚合查询处理问题的真正可行的解决方案,重要的是确定要预计算的最佳聚合集,并有效地使用这些预计算的聚合。红砖聚合计算和管理系统(红砖Vista)为这些问题提供了一个完整的服务器集成解决方案。
{"title":"Red Brick Vista/sup TM/: aggregate computation and management","authors":"Latha S. Colby, R. Cole, E. Haslam, N. Jazayeri, Galt Johnson, William J. McKenna, L. Schumacher, David Wilhite","doi":"10.1109/ICDE.1998.655773","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655773","url":null,"abstract":"Aggregate query processing in large data warehouses is computationally intensive. Precomputation is an approach that can be used to speed up aggregate queries. However, in order to make precomputation a truly viable solution to the aggregate query processing problem, it is important to identify the best set of aggregates to precompute and to use these precomputed aggregates effectively. The Red Brick aggregate computation and management system (Red Brick Vista) provides a complete server integrated solution to these problems.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132162207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Mining optimized association rules with categorical and numeric attributes 挖掘具有分类和数字属性的优化关联规则
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655813
R. Rastogi, Kyuseok Shim
Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support or confidence of the rule is maximized. We generalize the optimized association rules problem in three ways: (1) association rules are allowed to contain disjunctions over uninstantiated attributes; (2) association rules are permitted to contain an arbitrary number of uninstantiated attributes; and (3) uninstantiated attributes can be either categorical or numeric. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving multiple attributes. We present effective techniques for pruning the search space when computing optimized association rules for both categorical and numeric attributes. Finally, we report the results of our experiments that indicate that our pruning algorithms are efficient for a large number of uninstantiated attributes, disjunctions and values in the domain of the attributes.
关联规则对于确定关系属性之间的相关性非常有用,在市场营销、金融和零售部门都有应用。此外,优化的关联规则是关注涉及某些属性的最有趣特征的有效方法。优化的关联规则允许包含未实例化的属性,问题在于确定实例化,从而使规则的支持度或置信度最大化。我们将优化后的关联规则问题概括为三种方式:(1)允许关联规则包含非实例化属性上的析取;(2)关联规则允许包含任意数量的未实例化属性;(3)未实例化的属性可以是分类的,也可以是数字的。我们的广义关联规则使我们能够提取更多关于涉及多个属性的季节性和本地模式的有用信息。在计算分类和数值属性的优化关联规则时,我们提出了有效的修剪搜索空间的技术。最后,我们报告了我们的实验结果,表明我们的修剪算法对于属性域中大量未实例化的属性、析取和值是有效的。
{"title":"Mining optimized association rules with categorical and numeric attributes","authors":"R. Rastogi, Kyuseok Shim","doi":"10.1109/ICDE.1998.655813","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655813","url":null,"abstract":"Association rules are useful for determining correlations between attributes of a relation and have applications in marketing, financial and retail sectors. Furthermore, optimized association rules are an effective way to focus on the most interesting characteristics involving certain attributes. Optimized association rules are permitted to contain uninstantiated attributes and the problem is to determine instantiations such that either the support or confidence of the rule is maximized. We generalize the optimized association rules problem in three ways: (1) association rules are allowed to contain disjunctions over uninstantiated attributes; (2) association rules are permitted to contain an arbitrary number of uninstantiated attributes; and (3) uninstantiated attributes can be either categorical or numeric. Our generalized association rules enable us to extract more useful information about seasonal and local patterns involving multiple attributes. We present effective techniques for pruning the search space when computing optimized association rules for both categorical and numeric attributes. Finally, we report the results of our experiments that indicate that our pruning algorithms are efficient for a large number of uninstantiated attributes, disjunctions and values in the domain of the attributes.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123935349","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 170
Failure handling and coordinated execution of concurrent workflows 故障处理和并发工作流的协调执行
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655796
M. Kamath, K. Ramamritham
Workflow management systems (WFMSs) coordinate the execution of applications distributed over networks. In WFMSs, data inconsistencies can arise due to: the interaction between steps of concurrent threads within a workflow (intra-workflow coordination); the interaction between steps of concurrent workflows (inter-workflow coordination); and the presence of failures. Since these problems have not received adequate attention, this paper focuses on developing the necessary concepts and infrastructure to handle them. First, to deal with inter- and intra-workflow coordination requirements we have identified a set of high level building blocks. Secondly, to handle failures we propose a novel and pragmatic approach called opportunistic compensation and re-execution that allows a workflow designer to customize workflow recovery from correctness as well as performance perspectives. Thirdly based on these concepts we have designed a workflow specification language that expresses new requirements for workflow executions and implemented a run-time system for managing workflow executions while satisfying the new requirements. These ideas are geared towards improving the modeling and correctness properties offered by WFMSs and making them more robust and flexible.
工作流管理系统(WFMSs)协调分布在网络上的应用程序的执行。在WFMSs中,数据不一致可能由以下原因引起:工作流中并发线程的步骤之间的交互(工作流内部协调);并发工作流各步骤之间的交互(工作流间协调);以及失败的存在。由于这些问题没有得到足够的重视,本文的重点是发展必要的概念和基础设施来处理它们。首先,为了处理工作流程之间和内部的协调需求,我们已经确定了一组高级构建块。其次,为了处理失败,我们提出了一种新颖实用的方法,称为机会补偿和重新执行,它允许工作流设计者从正确性和性能的角度定制工作流恢复。在此基础上,设计了表达工作流执行新需求的工作流规范语言,并实现了满足新需求的工作流执行管理运行时系统。这些想法旨在改进wfms提供的建模和正确性属性,并使它们更加健壮和灵活。
{"title":"Failure handling and coordinated execution of concurrent workflows","authors":"M. Kamath, K. Ramamritham","doi":"10.1109/ICDE.1998.655796","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655796","url":null,"abstract":"Workflow management systems (WFMSs) coordinate the execution of applications distributed over networks. In WFMSs, data inconsistencies can arise due to: the interaction between steps of concurrent threads within a workflow (intra-workflow coordination); the interaction between steps of concurrent workflows (inter-workflow coordination); and the presence of failures. Since these problems have not received adequate attention, this paper focuses on developing the necessary concepts and infrastructure to handle them. First, to deal with inter- and intra-workflow coordination requirements we have identified a set of high level building blocks. Secondly, to handle failures we propose a novel and pragmatic approach called opportunistic compensation and re-execution that allows a workflow designer to customize workflow recovery from correctness as well as performance perspectives. Thirdly based on these concepts we have designed a workflow specification language that expresses new requirements for workflow executions and implemented a run-time system for managing workflow executions while satisfying the new requirements. These ideas are geared towards improving the modeling and correctness properties offered by WFMSs and making them more robust and flexible.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124796380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 113
Compressing relations and indexes 压缩关系和索引
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655800
J. Goldstein, R. Ramakrishnan, U. Shaft
We propose a new compression algorithm that is tailored to database applications. It can be applied to a collection of records, and is especially effective for records with many low to medium cardinality fields and numeric fields. In addition, this new technique supports very fast decompression. Promising application domains include decision support systems (DSS), since fact tables, which are by far the largest tables in these applications, contain many low and medium cardinality fields and typically no text fields. Further, our decompression rates are faster than typical disk throughputs for sequential scans; in contrast, gzip is slower. This is important in DSS applications, which often scan large ranges of records. An important distinguishing characteristic of our algorithm, in contrast to compression algorithms proposed earlier, is that we can decompress individual tuples (even individual fields), rather than a full page (or an entire relation) at a time. Also, all the information needed for tuple decompression resides on the same page with the tuple. This means that a page can be stored in the buffer pool and used in compressed form, simplifying the job of the buffer manager and improving memory utilization. Our compression algorithm also improves index structures such as B-trees and R-trees significantly by reducing the number of leaf pages and compressing index entries, which greatly increases the fan-out. We can also use lossy compression on the internal nodes of an index.
我们提出了一种适合数据库应用的新的压缩算法。它可以应用于记录集合,并且对于具有许多低到中等基数字段和数字字段的记录特别有效。此外,这种新技术支持非常快的解压缩。有前途的应用领域包括决策支持系统(DSS),因为事实表(这些应用程序中迄今为止最大的表)包含许多低基数和中等基数字段,通常没有文本字段。此外,我们的解压缩速率比顺序扫描的典型磁盘吞吐量更快;相比之下,gzip要慢一些。这在经常扫描大范围记录的DSS应用程序中很重要。与前面提出的压缩算法相比,我们算法的一个重要区别特征是,我们可以一次解压缩单个元组(甚至单个字段),而不是整个页面(或整个关系)。此外,元组解压缩所需的所有信息都与元组位于同一页面上。这意味着可以将页面存储在缓冲池中并以压缩形式使用,从而简化了缓冲区管理器的工作并提高了内存利用率。我们的压缩算法还通过减少叶页的数量和压缩索引条目来显著改进索引结构,如b树和r树,这大大增加了扇出。我们也可以对索引的内部节点使用有损压缩。
{"title":"Compressing relations and indexes","authors":"J. Goldstein, R. Ramakrishnan, U. Shaft","doi":"10.1109/ICDE.1998.655800","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655800","url":null,"abstract":"We propose a new compression algorithm that is tailored to database applications. It can be applied to a collection of records, and is especially effective for records with many low to medium cardinality fields and numeric fields. In addition, this new technique supports very fast decompression. Promising application domains include decision support systems (DSS), since fact tables, which are by far the largest tables in these applications, contain many low and medium cardinality fields and typically no text fields. Further, our decompression rates are faster than typical disk throughputs for sequential scans; in contrast, gzip is slower. This is important in DSS applications, which often scan large ranges of records. An important distinguishing characteristic of our algorithm, in contrast to compression algorithms proposed earlier, is that we can decompress individual tuples (even individual fields), rather than a full page (or an entire relation) at a time. Also, all the information needed for tuple decompression resides on the same page with the tuple. This means that a page can be stored in the buffer pool and used in compressed form, simplifying the job of the buffer manager and improving memory utilization. Our compression algorithm also improves index structures such as B-trees and R-trees significantly by reducing the number of leaf pages and compressing index entries, which greatly increases the fan-out. We can also use lossy compression on the internal nodes of an index.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130030753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 229
Generalizing "search" in generalized search trees 在广义搜索树中泛化“搜索”
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655801
Paul M. Aoki
The generalized search tree, or GiST, defines a framework of basic interfaces required to construct a hierarchical access method for database systems. As originally specified, GiST only supports record selection. We show how a small number of additional interfaces enable GiST to support a much larger class of operations. Members of this class, which includes, nearest-neighbor and ranked search, user-defined aggregation and index-assisted selectivity estimation, are increasingly common in new database applications. The advantages of implementing these operations in the GiST framework include reduction of user development effort and the ability to use industrial strength concurrency and recovery mechanisms provided by expert implementers.
广义搜索树(GiST)定义了为数据库系统构建分层访问方法所需的基本接口框架。正如最初指定的,GiST只支持记录选择。我们将展示少量附加接口如何使GiST支持更大的操作类。这类的成员,包括最近邻和排名搜索、用户定义聚合和索引辅助选择性估计,在新的数据库应用程序中越来越常见。在GiST框架中实现这些操作的优点包括减少用户开发工作和使用专家实现者提供的工业强度并发性和恢复机制的能力。
{"title":"Generalizing \"search\" in generalized search trees","authors":"Paul M. Aoki","doi":"10.1109/ICDE.1998.655801","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655801","url":null,"abstract":"The generalized search tree, or GiST, defines a framework of basic interfaces required to construct a hierarchical access method for database systems. As originally specified, GiST only supports record selection. We show how a small number of additional interfaces enable GiST to support a much larger class of operations. Members of this class, which includes, nearest-neighbor and ranked search, user-defined aggregation and index-assisted selectivity estimation, are increasingly common in new database applications. The advantages of implementing these operations in the GiST framework include reduction of user development effort and the ability to use industrial strength concurrency and recovery mechanisms provided by expert implementers.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128897742","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
Graph structured views and their incremental maintenance 图结构化视图及其增量维护
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655767
Yue Zhuge, H. Garcia-Molina
Studies the problem of maintaining materialized views of graph structured data. The base data consists of records containing identifiers of other records. The data could represent traditional objects (with methods, attributes and a class hierarchy), but it could also represent a lower-level data structure. We define simple views and materialized views for such graph structured data, analyzing options for representing record identity and references in the view. We develop incremental maintenance algorithms for these views.
研究了图结构数据实体化视图的维护问题。基本数据由包含其他记录标识符的记录组成。数据可以表示传统的对象(带有方法、属性和类层次结构),但也可以表示较低级别的数据结构。我们为这样的图形结构化数据定义了简单视图和物化视图,分析了在视图中表示记录标识和引用的选项。我们为这些视图开发了增量维护算法。
{"title":"Graph structured views and their incremental maintenance","authors":"Yue Zhuge, H. Garcia-Molina","doi":"10.1109/ICDE.1998.655767","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655767","url":null,"abstract":"Studies the problem of maintaining materialized views of graph structured data. The base data consists of records containing identifiers of other records. The data could represent traditional objects (with methods, attributes and a class hierarchy), but it could also represent a lower-level data structure. We define simple views and materialized views for such graph structured data, analyzing options for representing record identity and references in the view. We develop incremental maintenance algorithms for these views.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131066536","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 122
Flattening an object algebra to provide performance 将对象代数扁平化以提供性能
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655820
P. Boncz, A. N. Wilschut, M. Kersten
Algebraic transformation and optimization techniques have been the method of choice in relational query execution, but applying them in object-oriented (OO) DBMSs is difficult due to the complexity of OO query languages. This paper demonstrates that the problem can be simplified by mapping an OO data model to the binary relational model implemented by Monet, a state-of-the-art database kernel. We present a generic mapping scheme to flatten data models and study the case of straightforward OO model. We show how flattening enabled us to implement a query algebra, using only a very limited set of simple operations. The required primitives and query execution strategies are discussed, and their performance is evaluated on the 1-GByte TPC-D (Transaction-processing Performance Council's Benchmark D), showing that our divide-and-conquer approach yields excellent results.
代数转换和优化技术一直是关系查询执行的首选方法,但由于面向对象查询语言的复杂性,在面向对象(OO) dbms中应用它们很困难。本文证明,可以通过将OO数据模型映射到由最先进的数据库内核Monet实现的二进制关系模型来简化这个问题。我们提出了一种通用的映射方案来平面化数据模型,并研究了直接面向对象模型的情况。我们将展示扁平化如何使我们能够仅使用一组非常有限的简单操作来实现查询代数。讨论了所需的原语和查询执行策略,并在1-GByte TPC-D(事务处理性能委员会的基准测试D)上评估了它们的性能,表明我们的分而治之方法产生了出色的结果。
{"title":"Flattening an object algebra to provide performance","authors":"P. Boncz, A. N. Wilschut, M. Kersten","doi":"10.1109/ICDE.1998.655820","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655820","url":null,"abstract":"Algebraic transformation and optimization techniques have been the method of choice in relational query execution, but applying them in object-oriented (OO) DBMSs is difficult due to the complexity of OO query languages. This paper demonstrates that the problem can be simplified by mapping an OO data model to the binary relational model implemented by Monet, a state-of-the-art database kernel. We present a generic mapping scheme to flatten data models and study the case of straightforward OO model. We show how flattening enabled us to implement a query algebra, using only a very limited set of simple operations. The required primitives and query execution strategies are discussed, and their performance is evaluated on the 1-GByte TPC-D (Transaction-processing Performance Council's Benchmark D), showing that our divide-and-conquer approach yields excellent results.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132023736","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 84
Optimizing regular path expressions using graph schemas 使用图模式优化正则路径表达式
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655753
M. Fernández, Dan Suciu
Query languages for data with irregular structure use regular path expressions for navigation. This feature is useful for querying data where parts of the structure is either unknown, unavailable to the user, or changes frequently. Naive execution of regular path expressions is inefficient however, because it ignores any structure in the data. We describe two optimization techniques for queries with regular path expressions. Both rely on graph schemas for specifying partial knowledge about the data's structure. Query pruning uses this structure to restrict navigation to only a fragment of the data; we give an efficient algorithm for rewriting any regular path expression query into a pruned one. Query rewriting using state extents can eliminate or reduce navigation altogether; it is reminiscent of optimizing relational queries using indices. There may be several ways to optimize a query using state extents; we give a polynomial space algorithm that finds all such optimizations. For restricted forms of regular path expressions, the algorithm is provably efficient. We also give an efficient approximation algorithm that works on all regular path expressions.
对于不规则结构的数据,查询语言使用正则路径表达式进行导航。此特性对于查询结构的某些部分未知、用户不可用或经常更改的数据非常有用。然而,单纯地执行正则路径表达式是低效的,因为它忽略了数据中的任何结构。我们描述了使用正则路径表达式查询的两种优化技术。两者都依赖于图模式来指定关于数据结构的部分知识。查询修剪使用这种结构将导航限制为数据的一个片段;我们给出了一种将任意正则路径表达式查询重写为精简查询的有效算法。使用状态范围重写查询可以完全消除或减少导航;这让人想起使用索引优化关系查询。可能有几种方法可以使用状态区来优化查询;我们给出了一个多项式空间算法来找到所有这样的优化。对于正则路径表达式的限制形式,证明了该算法的有效性。我们还给出了一个有效的近似算法,适用于所有正则路径表达式。
{"title":"Optimizing regular path expressions using graph schemas","authors":"M. Fernández, Dan Suciu","doi":"10.1109/ICDE.1998.655753","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655753","url":null,"abstract":"Query languages for data with irregular structure use regular path expressions for navigation. This feature is useful for querying data where parts of the structure is either unknown, unavailable to the user, or changes frequently. Naive execution of regular path expressions is inefficient however, because it ignores any structure in the data. We describe two optimization techniques for queries with regular path expressions. Both rely on graph schemas for specifying partial knowledge about the data's structure. Query pruning uses this structure to restrict navigation to only a fragment of the data; we give an efficient algorithm for rewriting any regular path expression query into a pruned one. Query rewriting using state extents can eliminate or reduce navigation altogether; it is reminiscent of optimizing relational queries using indices. There may be several ways to optimize a query using state extents; we give a polynomial space algorithm that finds all such optimizations. For restricted forms of regular path expressions, the algorithm is provably efficient. We also give an efficient approximation algorithm that works on all regular path expressions.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115350915","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 286
Back to the future: dynamic hierarchical clustering 回到未来:动态分层集群
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655821
Chendong Zou, B. Salzberg, R. Ladin
Describes a new method for dynamically clustering hierarchical data which maintains good clustering within disk pages in the presence of insertions and deletions. This simple but effective method, which we call Enc, encodes the insertion order of children with respect to their parents and concatenates the insertion numbers to form a compact key for the data. This compact key is stored only in the indexing structure and does not affect the logical database schema. Experimental results show that our Enc method is very efficient for hierarchical queries and performs reasonably well for random access queries.
描述一种动态聚类分层数据的新方法,该方法在存在插入和删除的情况下在磁盘页内保持良好的聚类。这种简单而有效的方法,我们称之为Enc,它对子节点相对于父节点的插入顺序进行编码,并将插入编号连接起来,形成数据的紧凑键。此压缩键仅存储在索引结构中,不影响逻辑数据库模式。实验结果表明,我们的Enc方法对分层查询非常有效,对随机访问查询也有相当好的性能。
{"title":"Back to the future: dynamic hierarchical clustering","authors":"Chendong Zou, B. Salzberg, R. Ladin","doi":"10.1109/ICDE.1998.655821","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655821","url":null,"abstract":"Describes a new method for dynamically clustering hierarchical data which maintains good clustering within disk pages in the presence of insertions and deletions. This simple but effective method, which we call Enc, encodes the insertion order of children with respect to their parents and concatenates the insertion numbers to form a compact key for the data. This compact key is stored only in the indexing structure and does not affect the logical database schema. Experimental results show that our Enc method is very efficient for hierarchical queries and performs reasonably well for random access queries.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115693512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Fuzzy triggers: incorporating imprecise reasoning into active databases 模糊触发器:将不精确的推理纳入活动数据库
Pub Date : 1998-02-23 DOI: 10.1109/ICDE.1998.655766
A. Wolski, T. Bouaziz
Traditional event-condition-action triggers (active database rules) include a Boolean predicate as a trigger condition. We propose fuzzy triggers whereby fuzzy inference is utilized in the condition evaluation. In this way, approximate reasoning may be integrated with a traditional crisp database. The new approach paves the way for intuitive expression of application semantics of imprecise nature, in database-bound applications. Two fuzzy trigger models are proposed. Firstly, a set of fuzzy rules is encapsulated into a Boolean-valued function called a rule set function, leading to the C-fuzzy trigger model. Subsequently, actions are expressed also in fuzzy terms, and the corresponding CA-fuzzy trigger model is proposed. Examples are provided to illustrate how fuzzy triggers can be applied to a real-life drive control system in an industrial installation.
传统的事件-条件-操作触发器(活动数据库规则)包括一个布尔谓词作为触发条件。我们提出模糊触发器,在条件评估中利用模糊推理。通过这种方式,近似推理可以与传统的crisp数据库相结合。这种新方法为在数据库绑定的应用程序中直观地表达不精确的应用程序语义铺平了道路。提出了两种模糊触发模型。首先,将一组模糊规则封装到一个称为规则集函数的布尔值函数中,从而得到C-fuzzy触发器模型。随后,将动作也用模糊项表示,并提出相应的ca -模糊触发模型。举例说明如何模糊触发器可以应用于现实生活中的驱动控制系统在工业装置。
{"title":"Fuzzy triggers: incorporating imprecise reasoning into active databases","authors":"A. Wolski, T. Bouaziz","doi":"10.1109/ICDE.1998.655766","DOIUrl":"https://doi.org/10.1109/ICDE.1998.655766","url":null,"abstract":"Traditional event-condition-action triggers (active database rules) include a Boolean predicate as a trigger condition. We propose fuzzy triggers whereby fuzzy inference is utilized in the condition evaluation. In this way, approximate reasoning may be integrated with a traditional crisp database. The new approach paves the way for intuitive expression of application semantics of imprecise nature, in database-bound applications. Two fuzzy trigger models are proposed. Firstly, a set of fuzzy rules is encapsulated into a Boolean-valued function called a rule set function, leading to the C-fuzzy trigger model. Subsequently, actions are expressed also in fuzzy terms, and the corresponding CA-fuzzy trigger model is proposed. Examples are provided to illustrate how fuzzy triggers can be applied to a real-life drive control system in an industrial installation.","PeriodicalId":264926,"journal":{"name":"Proceedings 14th International Conference on Data Engineering","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"1998-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116584881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
期刊
Proceedings 14th International Conference on Data Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1