22nd International Conference on Data Engineering (ICDE'06)最新文献

英文中文

Estimating XML Structural Join Size Quickly and Economically 快速经济地估计XML结构连接大小

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.63

Cheng Luo, Zhewei Jiang, W. Hou, Feng Yan, Chih-Fang Wang

XML structural joins, which evaluate the containment (ancestor-descendant) relationships between XML elements, are important operations of XML query processing. Estimating structural join size accurately and quickly is thus crucial to the success of XML query plan selection and the query optimization. XML structural joins are essentially complex unequal joins, which render well-known estimation techniques, such as cosine transform, wavelet transform, and sketch, not directly applicable. In this paper, we propose a relation model to capture the structural information of XML data such that the original complex unequal joins are converted to equal joins and those well-known estimation techniques become directly applicable to structural join size estimation. Theoretical analyses and extensive experiments have been performed on these estimation methods. It is shown that the cosine transform requires the least memory and yields the best estimates.

XML结构连接评估XML元素之间的包含(祖先-后代)关系，是XML查询处理的重要操作。因此，准确、快速地估计结构连接大小对于XML查询计划选择和查询优化的成功至关重要。XML结构连接本质上是复杂的不等连接，这使得众所周知的估计技术(如余弦变换、小波变换和草图)不能直接应用。本文提出了一种关系模型来捕获XML数据的结构信息，从而将原来复杂的不等连接转换为相等连接，使那些著名的估计技术直接适用于结构连接大小的估计。对这些估计方法进行了理论分析和大量的实验。结果表明，余弦变换需要最少的内存，并产生最好的估计。

引用次数: 8

Experiment Management with Metadata-based Integration for Collaborative Scientific Research 基于元数据集成的协同科研实验管理

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.65

Fusheng Wang, Peiya Liu, John Pearson, F. Azar, G. Madlmayr

Scientific research in many fields is increasingly a collaborative effort across multiple institutions and disciplines. Scientific researchers need not only an effective system to manage their data, results, and the experiments that generate the results, but also a platform to integrate, share and search these across multiple institutions. Therefore, researchers are able to reuse experiments, pool expertise and validate approaches. In this paper, we present Sci- Port, a system of experiment management and integration for collaborative scientific research. SciPort’s architecture uses i) a general transformation-based data model to represent and link experiment processes; ii) hierarchical data classification across multiple institutions according to research programs’ goals and organization; iii) metadatacentric representation that concisely captures the context of experiments; and iv) virtual data integration through centralized metadata integration. The system is built for open source, and the metadata-based representation and integration provides a unified framework and tool set to manage and share experiments for scientific research communities.

许多领域的科学研究越来越需要多个机构和学科的合作。科研人员不仅需要一个有效的系统来管理他们的数据、结果和产生结果的实验，还需要一个平台来跨多个机构整合、共享和搜索这些数据。因此，研究人员能够重用实验，汇集专业知识和验证方法。本文提出了一个协作科研实验管理与集成系统Sci- Port。SciPort的架构使用i)一个通用的基于转换的数据模型来表示和链接实验过程;Ii)根据研究项目目标和组织在多个机构之间进行分层数据分类;Iii)元数据中心表示，简洁地捕捉实验背景;iv)通过集中元数据集成实现虚拟数据集成。系统面向开源，基于元数据的表示和集成为科研团体提供了统一的实验管理和共享框架和工具集。

{"title":"Experiment Management with Metadata-based Integration for Collaborative Scientific Research","authors":"Fusheng Wang, Peiya Liu, John Pearson, F. Azar, G. Madlmayr","doi":"10.1109/ICDE.2006.65","DOIUrl":"https://doi.org/10.1109/ICDE.2006.65","url":null,"abstract":"Scientific research in many fields is increasingly a collaborative effort across multiple institutions and disciplines. Scientific researchers need not only an effective system to manage their data, results, and the experiments that generate the results, but also a platform to integrate, share and search these across multiple institutions. Therefore, researchers are able to reuse experiments, pool expertise and validate approaches. In this paper, we present Sci- Port, a system of experiment management and integration for collaborative scientific research. SciPort’s architecture uses i) a general transformation-based data model to represent and link experiment processes; ii) hierarchical data classification across multiple institutions according to research programs’ goals and organization; iii) metadatacentric representation that concisely captures the context of experiments; and iv) virtual data integration through centralized metadata integration. The system is built for open source, and the metadata-based representation and integration provides a unified framework and tool set to manage and share experiments for scientific research communities.","PeriodicalId":6819,"journal":{"name":"22nd International Conference on Data Engineering (ICDE'06)","volume":"14 1","pages":"96-96"},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88470297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 13

ConQueSt: a Constraint-based Querying System for Exploratory Pattern Discovery 征服:一种基于约束的探索性模式发现查询系统

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.42

F. Bonchi, F. Giannotti, C. Lucchese, S. Orlando, R. Perego, R. Trasarti

ConQueSt is a constraint-based querying system devised with the aim of supporting the intrinsically exploratory nature of pattern discovery. It provides users with an expressive constraint-based query language which allows the discovery process to be effectively driven toward potentially interesting patterns. Constraints are also exploited to reduce the cost of pattern mining. The system is built around an efficient constraint-based mining engine which entails several data and search space reduction techniques, and allows new user-defined constraints to be easily added.

ConQueSt是一个基于约束的查询系统，旨在支持模式发现的内在探索性。它为用户提供了一种富有表现力的基于约束的查询语言，允许将发现过程有效地导向潜在的有趣模式。约束也被用来降低模式挖掘的成本。该系统围绕一个高效的基于约束的挖掘引擎构建，该引擎需要几种数据和搜索空间缩减技术，并允许轻松添加新的用户定义约束。

引用次数: 54

An Efficient XPath Query Processor for XML Streams 用于XML流的高效XPath查询处理器

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.18

Yi Chen, S. Davidson, Yifeng Zheng

Streaming XPath evaluation algorithms must record a potentially exponential number of pattern matches when both predicates and descendant axes are present in queries, and the XML data is recursive. In this paper, we use a compact data structure to encode these pattern matches rather than storing them explicitly. We then propose a polynomial time streaming algorithm to evaluate XPath queries by probing the data structure in a lazy fashion. Extensive experiments show that our approach not only has a good theoretical complexity bound but is also efficient in practice.

如果查询中同时存在谓词和后代轴，并且XML数据是递归的，那么流XPath求值算法必须记录模式匹配的潜在指数数量。在本文中，我们使用紧凑的数据结构来编码这些模式匹配，而不是显式地存储它们。然后，我们提出一个多项式时间流算法，通过以惰性方式探测数据结构来评估XPath查询。大量的实验表明，我们的方法不仅具有良好的理论复杂度界限，而且在实践中是有效的。

引用次数: 115

U-Filter: A Lightweight XML View Update Checker 一个轻量级的XML视图更新检查器

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.163

Ling Wang, Elke A. Rundensteiner, Murali Mani

Both XML-relational systems and native XML systems support creating XML wrapper views and querying against them. However, update operations against such virtual XML views in most cases are not supported yet.

XML关系系统和原生XML系统都支持创建XML包装器视图并对其进行查询。但是，在大多数情况下，还不支持针对此类虚拟XML视图的更新操作。

引用次数: 10

MAPLE: A Mobile Scalable P2P Nearest Neighbor Query System for Location-based Services 基于位置服务的移动可扩展P2P最近邻查询系统

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.89

Wei-Shinn Ku, Roger Zimmermann, C. Wan, Haojun Wang

In this demonstration we present MAPLE, a scalable peer-to-peer nearest neighbor (NN) query system for mobile environments. MAPLE is designed for the efficient sharing of query results cached in the local storage of mobile peers. The MAPLE system is innovative in its ability to either fully or partially compute location-dependent nearest neighbor objects on each host. The demonstration illustrates how cooperative data sharing and distributed processing among mobile peers results in a considerable reduction of the load on remote spatial databases.

在这个演示中，我们展示了MAPLE，一个用于移动环境的可扩展的点对点最近邻(NN)查询系统。MAPLE是为高效地共享缓存在移动对等体本地存储中的查询结果而设计的。MAPLE系统在其完全或部分计算每个主机上与位置相关的最近邻对象的能力方面具有创新性。该演示演示了移动对等点之间的协作数据共享和分布式处理如何显著减少远程空间数据库的负载。

引用次数: 14

Mining Shifting-and-Scaling Co-Regulation Patterns on Gene Expression Profiles 挖掘基因表达谱的移位和缩放协同调控模式

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.98

Xin Xu, Ying Lu, A. Tung, Wei Wang

In this paper, we propose a new model for coherent clustering of gene expression data called reg-cluster. The proposed model allows (1) the expression profiles of genes in a cluster to follow any shifting-and-scaling patterns in subspace, where the scaling can be either positive or negative, and (2) the expression value changes across any two conditions of the cluster to be significant. No previous work measures up to the task that we have set: the density-based subspace clustering algorithms require genes to have similar expression levels to each other in subspace; the pattern-based biclustering algorithms only allow pure shifting or pure scaling patterns; and the tendency-based biclustering algorithms have no coherence guarantees. We also develop a novel patternbased biclustering algorithm for identifying shifting-andscaling co-regulation patterns, satisfying both coherence constraint and regulation constraint. Our experimental results show that the reg-cluster algorithm is able to detect a significant amount of clusters missed by previous models, and these clusters are potentially of high biological significance.

在本文中，我们提出了一种新的基因表达数据的相干聚类模型，称为reg-cluster。所提出的模型允许(1)集群中基因的表达谱在子空间中遵循任何移动和缩放模式，其中缩放可以是正的或负的;(2)在集群的任何两种条件下表达值的变化都是显著的。以前的工作没有达到我们设定的任务:基于密度的子空间聚类算法要求基因在子空间中具有相似的表达水平;基于模式的双聚类算法只允许纯移动或纯缩放模式;基于趋势的双聚类算法没有一致性保证。我们还开发了一种新的基于模式的双聚类算法，用于识别移动和缩放的共调节模式，同时满足相干约束和调节约束。我们的实验结果表明，reg-cluster算法能够检测到之前模型遗漏的大量聚类，这些聚类可能具有很高的生物学意义。

引用次数: 57

Integrating Unstructured Data into Relational Databases 将非结构化数据集成到关系数据库中

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.83

I. Mansuri, Sunita Sarawagi

In this paper we present a system for automatically integrating unstructured text into a multi-relational database using state-of-the-art statistical models for structure extraction and matching. We show how to extend current highperforming models, Conditional Random Fields and their semi-markov counterparts, to effectively exploit a variety of recognition clues available in a database of entities, thereby significantly reducing the dependence on manually labeled training data. Our system is designed to load unstructured records into columns spread across multiple tables in the database while resolving the relationship of the extracted text with existing column values, and preserving the cardinality and link constraints of the database. We show how to combine the inference algorithms of statistical models with the database imposed constraints for optimal data integration.

在本文中，我们提出了一个系统，用于自动集成非结构化文本到一个多关系数据库中，使用最先进的统计模型进行结构提取和匹配。我们展示了如何扩展当前高性能模型，条件随机场及其半马尔可夫对应物，以有效地利用实体数据库中可用的各种识别线索，从而显着减少对手动标记训练数据的依赖。我们的系统旨在将非结构化记录加载到数据库中分布在多个表中的列中，同时解决提取的文本与现有列值的关系，并保留数据库的基数和链接约束。我们展示了如何将统计模型的推理算法与数据库强加的约束相结合，以实现最佳数据集成。

引用次数: 123

Practical Adaptation to Changing Resources in Grid Query Processing 网格查询处理中对资源变化的实际适应

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.113

A. Gounaris, N. Paton, R. Sakellariou, A. Fernandes, Jim Smith, P. Watson

Grid computational resources, as well as being heterogeneous, may also exhibit unpredictable, volatile behaviour. Therefore, query processing on the Grid needs to be adaptive in order to cope with evolving resource characteristics, such as machine load and availability. To address this challenge in a Grid environment, the non-adaptive OGSA-DQP1 system described in [1] has been enhanced with adaptive capabilities.

网格计算资源，以及异构性，也可能表现出不可预测的、易变的行为。因此，网格上的查询处理需要自适应，以应对不断变化的资源特征，如机器负载和可用性。为了应对网格环境中的这一挑战，[1]中描述的非自适应OGSA-DQP1系统已经增强了自适应能力。

引用次数: 18

ProcessingWindow Queries in Wireless Sensor Networks 无线传感器网络中的窗口查询处理

22nd International Conference on Data Engineering (ICDE'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDE.2006.119

Yingqi Xu, Wang-Chien Lee, Jianliang Xu, Gail Mitchell

The existing query processing techniques for sensor networks rely on a network infrastructure for query propagation and data collection. However, such an infrastructure is very susceptible to network topology transients that widely exist in sensor networks. In this paper, we propose an infrastructure-free window query processing technique for sensor networks, called itinerary-based window query execution (IWQE), in which query propagation and data collection are combined into one single stage and executed along a well-designed itinerary inside a query window. We study the parameters for setting up an itinerary (e.g., width and route) and incorporate into IWQE three data collection schemes based on different performance trade-offs. Finally we demonstrate, by extensive simulations, the superior energy-time efficiency, robustness, and accuracy of IWQE over the current state-of-the-art techniques in supporting window queries under various network conditions.

现有的传感器网络查询处理技术依赖于网络基础设施来进行查询传播和数据收集。然而，这种基础结构非常容易受到传感器网络中广泛存在的网络拓扑暂态的影响。在本文中，我们提出了一种无基础设施的传感器网络窗口查询处理技术，称为基于行程的窗口查询执行(IWQE)，该技术将查询传播和数据收集结合到一个阶段，并在查询窗口内沿着精心设计的行程执行。我们研究了设置行程的参数(例如，宽度和路线)，并将基于不同性能权衡的三种数据收集方案纳入IWQE。最后，通过广泛的模拟，我们证明了IWQE在支持各种网络条件下的窗口查询方面比当前最先进的技术具有更高的能量时间效率、鲁棒性和准确性。

引用次数: 34

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

22nd International Conference on Data Engineering (ICDE'06)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀