22nd International Conference on Data Engineering Workshops (ICDEW'06)最新文献

英文中文

New Functions of File Systems to Manage Information Shared by Communities 文件系统管理社区共享信息的新功能

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.98

Ken'ichi Ishikawa, Atsuyuki Morishima, S. Sugimoto

Today, more and more people in knowledge communities, like research laboratories, use shared file servers to store and share their information. People in such communities often work together and their files stored in a file server have relationships with each other. Information on the relationships is usually exchanged offline and used implicitly to facilitate the management and sharing of the files. This paper proposes new functions to manage and use the relationships to make various views on the file servers. The functions provide a high-level support and are compatible with the operational framework of existing file systems.

今天，越来越多的人在知识社区，如研究实验室，使用共享文件服务器来存储和共享他们的信息。这些社区中的人们经常一起工作，他们存储在文件服务器中的文件彼此之间存在关系。有关关系的信息通常离线交换，并隐式地用于促进文件的管理和共享。本文提出了新的功能来管理和使用文件服务器上的各种视图的关系。这些功能提供了高级支持，并与现有文件系统的操作框架兼容。

引用次数: 0

A Peer-to-Peer Architecture to Enable Versatile Lookup System Design 实现多用途查找系统设计的点对点架构

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.17

Vivek Sawant, J. Kaur

The resource lookup requirements in applications such as web caching, web content search, content distribution, resource sharing, network monitoring and management, and e-commerce have caught the attention of peer-to-peer (P2P) distributed systems researchers. Over the past few years, several decentralized P2P lookup system designs have been proposed for addressing these requirements. Most of these early designs are targeted at specific applications. Unfortunately, the variations in the operating environments and lookup characteristics across applications restricts the applicability of such specialized designs. In this paper, we present an architecture for P2P systems that identifies the functions necessary for designing resource lookup systems with wide applicability. We demonstrate the usefulness of the functions included in the architecture by illustrating their use in developing diverse lookup techniques.

web缓存、web内容搜索、内容分发、资源共享、网络监控和管理以及电子商务等应用中的资源查找需求引起了点对点分布式系统研究者的关注。在过去的几年中，为了满足这些需求，已经提出了几种分散的P2P查找系统设计。这些早期的设计大多针对特定的应用。不幸的是，操作环境和跨应用程序查找特性的变化限制了这种专门设计的适用性。在本文中，我们提出了一个P2P系统的体系结构，该体系结构确定了设计具有广泛适用性的资源查找系统所需的功能。我们通过说明在开发各种查找技术中的用法，来演示架构中包含的函数的有用性。

引用次数: 0

Dealing with Overload in Distributed Stream Processing Systems 分布式流处理系统中的过载处理

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.45

Nesime Tatbul, S. Zdonik

Overload management has been an important problem for large-scale dynamic systems. In this paper, we study this problem in the context of our Borealis distributed stream processing system. We show that server nodes must coordinate in their load shedding decisions to achieve global control on output quality. We describe a distributed load shedding approach which provides this coordination by upstream metadata aggregation and propagation. Metadata enables an upstream node to make fast local load shedding decisions which will influence its descendant nodes in the best possible way.

过载管理一直是大型动态系统的一个重要问题。本文在我们的Borealis分布式流处理系统的背景下研究了这个问题。我们表明服务器节点必须协调它们的减载决策，以实现对输出质量的全局控制。我们描述了一种分布式减载方法，该方法通过上游元数据聚合和传播提供这种协调。元数据使上游节点能够做出快速的本地减载决策，从而以最好的方式影响其后代节点。

引用次数: 34

Managing the Evolution of Dataflows with VisTrails 用细节管理数据流的演变

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.75

Steven P. Callahan, J. Freire, E. Santos, C. Scheidegger, Cláudio T. Silva, H. Vo

Scientists are now faced with an incredible volume of data to analyze. To successfully analyze and validate various hypotheses, it is necessary to pose several queries, correlate disparate data, and create insightful visualizations of both the simulated processes and observed phenomena. Data exploration through visualization requires scientists to go through several steps. In essence, they need to assemble complex workflows that consist of dataset selection, specification of series of operations that need to be applied to the data, and the creation of appropriate visual representations, before they can finally view and analyze the results. Often, insight comes from comparing the results of multiple visualizations that are created during the data exploration process.

科学家们现在面临着海量的数据需要分析。为了成功地分析和验证各种假设，有必要提出几个查询，关联不同的数据，并创建模拟过程和观察到的现象的深刻可视化。通过可视化进行数据探索需要科学家经历几个步骤。从本质上讲，他们需要组装复杂的工作流，包括数据集选择、需要应用于数据的一系列操作的规范，以及创建适当的可视化表示，然后才能最终查看和分析结果。通常，洞察力来自于比较在数据探索过程中创建的多个可视化结果。

引用次数: 143

Searching and Ranking Documents based on Semantic Relationships 基于语义关系的文档搜索和排序

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.131

Boanerges Aleman-Meza

Just as the link structure of the web is a critical component in today's web search, complex relationships (i.e., the different ways the dots are connected) will be an important component in tomorrow's web search technologies. In this paper, I summarize my research on answering the question of: How we can exploit semantic relationships of named-entities to improve relevance in search and ranking of documents? The intuition of my approach is to first analyze the relationships of namedentities with respect to a query. Second, relevance weights, which are assigned by human experts, can then be used to guarantee results within a relevance threshold. These relevance measures can be applied both for searching and ranking of documents.

正如网络的链接结构是当今网络搜索的关键组成部分一样，复杂的关系(即点连接的不同方式)将成为未来网络搜索技术的重要组成部分。在本文中，我总结了我在回答以下问题方面的研究:我们如何利用命名实体的语义关系来提高文档搜索和排名中的相关性?我的方法的直觉是首先分析与查询相关的命名实体的关系。其次，由人类专家分配的相关权重可以用来保证在相关阈值内的结果。这些相关性度量既可以用于文档的搜索，也可以用于文档的排序。

引用次数: 10

Unsupervised Outlier Detection in Time Series Data 时间序列数据的无监督离群点检测

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.157

Z. Ferdousi, Akira Maeda

Fraud detection is of great importance to financial institutions. This paper is concerned with the problem of finding outliers in time series financial data using Peer Group Analysis (PGA), which is an unsupervised technique for fraud detection. The objective of PGA is to characterize the expected pattern of behavior around the target sequence in terms of the behavior of similar objects, and then to detect any difference in evolution between the expected pattern and the target. The tool has been applied to the stock market data, which has been collected from Bangladesh Stock Exchange to assess its performance in stock fraud detection. We observed PGA can detect those brokers who suddenly start selling the stock in a different way to other brokers to whom they were previously similar. We also applied t-statistics to find the deviations effectively.

欺诈检测对金融机构来说非常重要。本文研究了一种无监督的欺诈检测技术——对等群分析(Peer Group Analysis, PGA)在时间序列金融数据中发现异常值的问题。PGA的目标是根据相似对象的行为来描述目标序列周围的预期行为模式，然后检测预期模式与目标之间的进化差异。该工具已应用于股票市场数据，这些数据已从孟加拉国证券交易所收集，以评估其在股票欺诈检测方面的表现。我们观察到PGA可以检测到那些突然开始以不同的方式出售股票的经纪人，而这些经纪人之前与他们相似。我们还应用了t统计量来有效地找到偏差。

引用次数: 90

Text Mining using PrefixSpan constrained by Item Interval and Item Attribute 基于项目间隔和项目属性约束的PrefixSpan文本挖掘

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.142

Issei Sato, Yu Hirate, H. Yamana

Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals--the number of items between any two adjacent items in a sequence--and the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.

将传统的顺序模式挖掘方法应用于文本数据中，会提取出许多不感兴趣的模式，这增加了对提取模式的解释时间。为了解决这一问题，我们提出了一种新的序列模式挖掘算法，该算法采用了以下两个约束条件。一种是根据项目间隔(序列中任意两个相邻项目之间的项目数量)选择序列，另一种是根据项目属性选择序列。通过使用图书类别中的Amazon客户评论，我们已经证实，我们的方法能够比传统方法更快地提取模式，并且能够在保留感兴趣的模式的同时更好地排除不感兴趣的模式。

引用次数: 1

MoSCoE: A Framework for Modeling Web Service Composition and Execution 建模Web服务组合和执行的框架

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.96

Jyotishman Pathak, Samik Basu, R. Lutz, Vasant G Honavar

Development of sound approaches and software tools for specification, assembly, and deployment of composite Web services from independently developed components promises to enhance collaborative software design and reuse. In this context, the proposed research introduces a new incremental approach to service composition, MoSCoE (Modeling Web Service Composition and Execution), based on the three steps of abstraction, composition and refinement. Abstraction refers to the high-level description of the service desired (goal) by the user, which drives the identification of an appropriate composition strategy. In the event that such a composition is not realizable, MoSCoE guides the user through successive refinements of the specification towards a realizable goal service that meets the user requirements.

为独立开发的组件的组合Web服务的规范、组装和部署开发可靠的方法和软件工具，有望增强协作式软件设计和重用。在此背景下，本文提出的研究引入了一种新的服务组合增量方法，即基于抽象、组合和细化三个步骤的建模Web服务组合和执行(MoSCoE)。抽象指的是用户期望的服务(目标)的高级描述，它驱动适当组合策略的识别。如果这样的组合是不可实现的，那么MoSCoE将引导用户通过对规范的连续改进来实现满足用户需求的可实现目标服务。

引用次数: 32

Towards Privacy-Aware Location-Based Database Servers 面向隐私感知的基于位置的数据库服务器

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.152

M. Mokbel

The wide spread of location-based services results in a strong market for location-detection devices (e.g., GPS-like devices, RFIDs, handheld devices, and cellular phones). Examples of location-based services include location-aware emergency service, location-based advertisement, live traffic reports, and location-based store finder. However, location-detection devices pose a major privacy threat on its users where it transmits private information (i.e., the location) to the server who may be untrustworthy. The existing model of location-based applications trades service with privacy where if a user wants to keep her private location information, she has to turn off her location-detection device, i.e., unsubscribe from the service. This paper tackles this model in a way that protects the user privacy while keeping the functionality of location-based services. The main idea is to employ a trusted third party, the Location Anonymizer, that expands the user location into a spatial region such that: (1) The exact user location can lie anywhere in the spatial region, and (2) There are k other users within the expanded spatial region so that each user is k-anonymous. The location-based database server is equipped with additional functionalities that support spatio-temporal queries based on the spatial region received from the location anonymizer rather than the exact point location received from the user.

基于位置的服务的广泛传播导致了位置检测设备(例如，类似gps的设备、rfid、手持设备和蜂窝电话)的强大市场。基于位置的服务的示例包括位置感知紧急服务、基于位置的广告、实时流量报告和基于位置的商店查找器。然而，位置检测设备对其用户构成了主要的隐私威胁，因为它将私人信息(即位置)传输到可能不值得信任的服务器。现有的基于位置的应用程序模型将服务与隐私交换，如果用户想要保留自己的私密位置信息，就必须关闭位置检测设备，也就是说，取消订阅服务。本文以一种既保护用户隐私又保持基于位置的服务功能的方式来解决这个模型。主要思想是使用一个可信的第三方，即位置匿名器，它将用户位置扩展到一个空间区域，这样:(1)确切的用户位置可以位于空间区域的任何位置;(2)在扩展的空间区域中有k个其他用户，因此每个用户都是k匿名的。基于位置的数据库服务器配备了额外的功能，这些功能支持基于从位置匿名器接收到的空间区域而不是从用户接收到的确切点位置进行时空查询。

{"title":"Towards Privacy-Aware Location-Based Database Servers","authors":"M. Mokbel","doi":"10.1109/ICDEW.2006.152","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.152","url":null,"abstract":"The wide spread of location-based services results in a strong market for location-detection devices (e.g., GPS-like devices, RFIDs, handheld devices, and cellular phones). Examples of location-based services include location-aware emergency service, location-based advertisement, live traffic reports, and location-based store finder. However, location-detection devices pose a major privacy threat on its users where it transmits private information (i.e., the location) to the server who may be untrustworthy. The existing model of location-based applications trades service with privacy where if a user wants to keep her private location information, she has to turn off her location-detection device, i.e., unsubscribe from the service. This paper tackles this model in a way that protects the user privacy while keeping the functionality of location-based services. The main idea is to employ a trusted third party, the Location Anonymizer, that expands the user location into a spatial region such that: (1) The exact user location can lie anywhere in the spatial region, and (2) There are k other users within the expanded spatial region so that each user is k-anonymous. The location-based database server is equipped with additional functionalities that support spatio-temporal queries based on the spatial region received from the location anonymizer rather than the exact point location received from the user.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125082480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 87

Twig Query Processing Under Concurrent Updates 并行更新下的小枝查询处理

22nd International Conference on Data Engineering Workshops (ICDEW'06)

Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.156

Christian Mathis, T. Härder

An appropriate database language characteristics leading to the success of declarative query processing - and, in turn, to the rise of relational DBMSs in general - always provides more than one way of evaluating a query. This counts for structurally different but logically equivalent query evaluation plans (QEPs) as well as for different implementations of the same logical operator. This principle surely holds for the novel XML database management systems (XDBMSs): Recently proposed operators for XML query processing can be grouped into the logical operators Structural Join [1, 22] and Holistic Twig Join [3, 6, 16]. Depending on available internal system mechanisms, a lot of opportunities exist how to implement these operators (two of which are presented in this paper.

适当的数据库语言特征会导致声明性查询处理的成功——进而导致关系型dbms的兴起——总是提供不止一种计算查询的方法。这对于结构不同但逻辑等效的查询计算计划(qep)以及相同逻辑运算符的不同实现都很重要。这一原则确实适用于新的XML数据库管理系统(xdbms):最近提出的用于XML查询处理的操作符可以分为逻辑操作符Structural Join[1,22]和Holistic Twig Join[3,6,16]。根据可用的内部系统机制，存在许多实现这些操作符的机会(本文给出了其中两个)。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

22nd International Conference on Data Engineering Workshops (ICDEW'06)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀