首页 > 最新文献

22nd International Conference on Data Engineering Workshops (ICDEW'06)最新文献

英文 中文
A Peer-to-Peer Architecture to Enable Versatile Lookup System Design 实现多用途查找系统设计的点对点架构
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.17
Vivek Sawant, J. Kaur
The resource lookup requirements in applications such as web caching, web content search, content distribution, resource sharing, network monitoring and management, and e-commerce have caught the attention of peer-to-peer (P2P) distributed systems researchers. Over the past few years, several decentralized P2P lookup system designs have been proposed for addressing these requirements. Most of these early designs are targeted at specific applications. Unfortunately, the variations in the operating environments and lookup characteristics across applications restricts the applicability of such specialized designs. In this paper, we present an architecture for P2P systems that identifies the functions necessary for designing resource lookup systems with wide applicability. We demonstrate the usefulness of the functions included in the architecture by illustrating their use in developing diverse lookup techniques.
web缓存、web内容搜索、内容分发、资源共享、网络监控和管理以及电子商务等应用中的资源查找需求引起了点对点分布式系统研究者的关注。在过去的几年中,为了满足这些需求,已经提出了几种分散的P2P查找系统设计。这些早期的设计大多针对特定的应用。不幸的是,操作环境和跨应用程序查找特性的变化限制了这种专门设计的适用性。在本文中,我们提出了一个P2P系统的体系结构,该体系结构确定了设计具有广泛适用性的资源查找系统所需的功能。我们通过说明在开发各种查找技术中的用法,来演示架构中包含的函数的有用性。
{"title":"A Peer-to-Peer Architecture to Enable Versatile Lookup System Design","authors":"Vivek Sawant, J. Kaur","doi":"10.1109/ICDEW.2006.17","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.17","url":null,"abstract":"The resource lookup requirements in applications such as web caching, web content search, content distribution, resource sharing, network monitoring and management, and e-commerce have caught the attention of peer-to-peer (P2P) distributed systems researchers. Over the past few years, several decentralized P2P lookup system designs have been proposed for addressing these requirements. Most of these early designs are targeted at specific applications. Unfortunately, the variations in the operating environments and lookup characteristics across applications restricts the applicability of such specialized designs. In this paper, we present an architecture for P2P systems that identifies the functions necessary for designing resource lookup systems with wide applicability. We demonstrate the usefulness of the functions included in the architecture by illustrating their use in developing diverse lookup techniques.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115208294","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Functions of File Systems to Manage Information Shared by Communities 文件系统管理社区共享信息的新功能
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.98
Ken'ichi Ishikawa, Atsuyuki Morishima, S. Sugimoto
Today, more and more people in knowledge communities, like research laboratories, use shared file servers to store and share their information. People in such communities often work together and their files stored in a file server have relationships with each other. Information on the relationships is usually exchanged offline and used implicitly to facilitate the management and sharing of the files. This paper proposes new functions to manage and use the relationships to make various views on the file servers. The functions provide a high-level support and are compatible with the operational framework of existing file systems.
今天,越来越多的人在知识社区,如研究实验室,使用共享文件服务器来存储和共享他们的信息。这些社区中的人们经常一起工作,他们存储在文件服务器中的文件彼此之间存在关系。有关关系的信息通常离线交换,并隐式地用于促进文件的管理和共享。本文提出了新的功能来管理和使用文件服务器上的各种视图的关系。这些功能提供了高级支持,并与现有文件系统的操作框架兼容。
{"title":"New Functions of File Systems to Manage Information Shared by Communities","authors":"Ken'ichi Ishikawa, Atsuyuki Morishima, S. Sugimoto","doi":"10.1109/ICDEW.2006.98","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.98","url":null,"abstract":"Today, more and more people in knowledge communities, like research laboratories, use shared file servers to store and share their information. People in such communities often work together and their files stored in a file server have relationships with each other. Information on the relationships is usually exchanged offline and used implicitly to facilitate the management and sharing of the files. This paper proposes new functions to manage and use the relationships to make various views on the file servers. The functions provide a high-level support and are compatible with the operational framework of existing file systems.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115897431","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Managing the Evolution of Dataflows with VisTrails 用细节管理数据流的演变
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.75
Steven P. Callahan, J. Freire, E. Santos, C. Scheidegger, Cláudio T. Silva, H. Vo
Scientists are now faced with an incredible volume of data to analyze. To successfully analyze and validate various hypotheses, it is necessary to pose several queries, correlate disparate data, and create insightful visualizations of both the simulated processes and observed phenomena. Data exploration through visualization requires scientists to go through several steps. In essence, they need to assemble complex workflows that consist of dataset selection, specification of series of operations that need to be applied to the data, and the creation of appropriate visual representations, before they can finally view and analyze the results. Often, insight comes from comparing the results of multiple visualizations that are created during the data exploration process.
科学家们现在面临着海量的数据需要分析。为了成功地分析和验证各种假设,有必要提出几个查询,关联不同的数据,并创建模拟过程和观察到的现象的深刻可视化。通过可视化进行数据探索需要科学家经历几个步骤。从本质上讲,他们需要组装复杂的工作流,包括数据集选择、需要应用于数据的一系列操作的规范,以及创建适当的可视化表示,然后才能最终查看和分析结果。通常,洞察力来自于比较在数据探索过程中创建的多个可视化结果。
{"title":"Managing the Evolution of Dataflows with VisTrails","authors":"Steven P. Callahan, J. Freire, E. Santos, C. Scheidegger, Cláudio T. Silva, H. Vo","doi":"10.1109/ICDEW.2006.75","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.75","url":null,"abstract":"Scientists are now faced with an incredible volume of data to analyze. To successfully analyze and validate various hypotheses, it is necessary to pose several queries, correlate disparate data, and create insightful visualizations of both the simulated processes and observed phenomena. Data exploration through visualization requires scientists to go through several steps. In essence, they need to assemble complex workflows that consist of dataset selection, specification of series of operations that need to be applied to the data, and the creation of appropriate visual representations, before they can finally view and analyze the results. Often, insight comes from comparing the results of multiple visualizations that are created during the data exploration process.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116254380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 143
Searching and Ranking Documents based on Semantic Relationships 基于语义关系的文档搜索和排序
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.131
Boanerges Aleman-Meza
Just as the link structure of the web is a critical component in today's web search, complex relationships (i.e., the different ways the dots are connected) will be an important component in tomorrow's web search technologies. In this paper, I summarize my research on answering the question of: How we can exploit semantic relationships of named-entities to improve relevance in search and ranking of documents? The intuition of my approach is to first analyze the relationships of namedentities with respect to a query. Second, relevance weights, which are assigned by human experts, can then be used to guarantee results within a relevance threshold. These relevance measures can be applied both for searching and ranking of documents.
正如网络的链接结构是当今网络搜索的关键组成部分一样,复杂的关系(即点连接的不同方式)将成为未来网络搜索技术的重要组成部分。在本文中,我总结了我在回答以下问题方面的研究:我们如何利用命名实体的语义关系来提高文档搜索和排名中的相关性?我的方法的直觉是首先分析与查询相关的命名实体的关系。其次,由人类专家分配的相关权重可以用来保证在相关阈值内的结果。这些相关性度量既可以用于文档的搜索,也可以用于文档的排序。
{"title":"Searching and Ranking Documents based on Semantic Relationships","authors":"Boanerges Aleman-Meza","doi":"10.1109/ICDEW.2006.131","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.131","url":null,"abstract":"Just as the link structure of the web is a critical component in today's web search, complex relationships (i.e., the different ways the dots are connected) will be an important component in tomorrow's web search technologies. In this paper, I summarize my research on answering the question of: How we can exploit semantic relationships of named-entities to improve relevance in search and ranking of documents? The intuition of my approach is to first analyze the relationships of namedentities with respect to a query. Second, relevance weights, which are assigned by human experts, can then be used to guarantee results within a relevance threshold. These relevance measures can be applied both for searching and ranking of documents.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116293418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Text Mining using PrefixSpan constrained by Item Interval and Item Attribute 基于项目间隔和项目属性约束的PrefixSpan文本挖掘
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.142
Issei Sato, Yu Hirate, H. Yamana
Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals--the number of items between any two adjacent items in a sequence--and the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.
将传统的顺序模式挖掘方法应用于文本数据中,会提取出许多不感兴趣的模式,这增加了对提取模式的解释时间。为了解决这一问题,我们提出了一种新的序列模式挖掘算法,该算法采用了以下两个约束条件。一种是根据项目间隔(序列中任意两个相邻项目之间的项目数量)选择序列,另一种是根据项目属性选择序列。通过使用图书类别中的Amazon客户评论,我们已经证实,我们的方法能够比传统方法更快地提取模式,并且能够在保留感兴趣的模式的同时更好地排除不感兴趣的模式。
{"title":"Text Mining using PrefixSpan constrained by Item Interval and Item Attribute","authors":"Issei Sato, Yu Hirate, H. Yamana","doi":"10.1109/ICDEW.2006.142","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.142","url":null,"abstract":"Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals--the number of items between any two adjacent items in a sequence--and the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122877670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards Privacy-Aware Location-Based Database Servers 面向隐私感知的基于位置的数据库服务器
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.152
M. Mokbel
The wide spread of location-based services results in a strong market for location-detection devices (e.g., GPS-like devices, RFIDs, handheld devices, and cellular phones). Examples of location-based services include location-aware emergency service, location-based advertisement, live traffic reports, and location-based store finder. However, location-detection devices pose a major privacy threat on its users where it transmits private information (i.e., the location) to the server who may be untrustworthy. The existing model of location-based applications trades service with privacy where if a user wants to keep her private location information, she has to turn off her location-detection device, i.e., unsubscribe from the service. This paper tackles this model in a way that protects the user privacy while keeping the functionality of location-based services. The main idea is to employ a trusted third party, the Location Anonymizer, that expands the user location into a spatial region such that: (1) The exact user location can lie anywhere in the spatial region, and (2) There are k other users within the expanded spatial region so that each user is k-anonymous. The location-based database server is equipped with additional functionalities that support spatio-temporal queries based on the spatial region received from the location anonymizer rather than the exact point location received from the user.
基于位置的服务的广泛传播导致了位置检测设备(例如,类似gps的设备、rfid、手持设备和蜂窝电话)的强大市场。基于位置的服务的示例包括位置感知紧急服务、基于位置的广告、实时流量报告和基于位置的商店查找器。然而,位置检测设备对其用户构成了主要的隐私威胁,因为它将私人信息(即位置)传输到可能不值得信任的服务器。现有的基于位置的应用程序模型将服务与隐私交换,如果用户想要保留自己的私密位置信息,就必须关闭位置检测设备,也就是说,取消订阅服务。本文以一种既保护用户隐私又保持基于位置的服务功能的方式来解决这个模型。主要思想是使用一个可信的第三方,即位置匿名器,它将用户位置扩展到一个空间区域,这样:(1)确切的用户位置可以位于空间区域的任何位置;(2)在扩展的空间区域中有k个其他用户,因此每个用户都是k匿名的。基于位置的数据库服务器配备了额外的功能,这些功能支持基于从位置匿名器接收到的空间区域而不是从用户接收到的确切点位置进行时空查询。
{"title":"Towards Privacy-Aware Location-Based Database Servers","authors":"M. Mokbel","doi":"10.1109/ICDEW.2006.152","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.152","url":null,"abstract":"The wide spread of location-based services results in a strong market for location-detection devices (e.g., GPS-like devices, RFIDs, handheld devices, and cellular phones). Examples of location-based services include location-aware emergency service, location-based advertisement, live traffic reports, and location-based store finder. However, location-detection devices pose a major privacy threat on its users where it transmits private information (i.e., the location) to the server who may be untrustworthy. The existing model of location-based applications trades service with privacy where if a user wants to keep her private location information, she has to turn off her location-detection device, i.e., unsubscribe from the service. This paper tackles this model in a way that protects the user privacy while keeping the functionality of location-based services. The main idea is to employ a trusted third party, the Location Anonymizer, that expands the user location into a spatial region such that: (1) The exact user location can lie anywhere in the spatial region, and (2) There are k other users within the expanded spatial region so that each user is k-anonymous. The location-based database server is equipped with additional functionalities that support spatio-temporal queries based on the spatial region received from the location anonymizer rather than the exact point location received from the user.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125082480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 87
Mining Spatial and Spatio-Temporal Patterns in Scientific Data 科学数据中的时空模式挖掘
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.92
Hui Yang, S. Parthasarathy
Data mining is the process of discovering hidden and meaningful knowledge in a data set. It has been successfully applied to many real-life problems, for instance, web personalization, network intrusion detection, and customized marketing. Recent advances in computational sciences have led to the application of data mining to various scientific domains, such as astronomy and bioinformatics, to facilitate the understanding of different scientific processes in the underlying domain. In this thesis work, we focus on designing and applying data mining techniques to analyze spatial and spatiotemporal data originated in scientific domains. Examples of spatial and spatio-temporal data in scientific domains include data describing protein structures and data produced from protein folding simulations, respectively. Specifically, we have proposed a generalized framework to effectively discover different types of spatial and spatio-temporal patterns in scientific data sets. Such patterns can be used to capture a variety of interactions among objects of interest and the evolutionary behavior of such interactions. We have applied the framework to analyze data originated in the following three application domains: bioinformatics, computational molecular dynamics, and computational fluid dynamics. Empirical results demonstrate that the discovered patterns are meaningful in the underlying domain and can provide important insights into various scientific phenomena.
数据挖掘是在数据集中发现隐藏的和有意义的知识的过程。它已经成功地应用于许多现实问题,例如,web个性化、网络入侵检测和定制营销。计算科学的最新进展导致数据挖掘应用于各种科学领域,如天文学和生物信息学,以促进对基础领域中不同科学过程的理解。在本论文中,我们着重于设计和应用数据挖掘技术来分析源自科学领域的时空数据。空间和时空数据在科学领域的例子分别包括描述蛋白质结构的数据和由蛋白质折叠模拟产生的数据。具体而言,我们提出了一个通用框架,以有效地发现科学数据集中不同类型的空间和时空模式。这种模式可用于捕获感兴趣的对象之间的各种交互以及这种交互的演化行为。我们已经应用该框架来分析来自以下三个应用领域的数据:生物信息学、计算分子动力学和计算流体动力学。实证结果表明,发现的模式在基础领域是有意义的,可以为各种科学现象提供重要的见解。
{"title":"Mining Spatial and Spatio-Temporal Patterns in Scientific Data","authors":"Hui Yang, S. Parthasarathy","doi":"10.1109/ICDEW.2006.92","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.92","url":null,"abstract":"Data mining is the process of discovering hidden and meaningful knowledge in a data set. It has been successfully applied to many real-life problems, for instance, web personalization, network intrusion detection, and customized marketing. Recent advances in computational sciences have led to the application of data mining to various scientific domains, such as astronomy and bioinformatics, to facilitate the understanding of different scientific processes in the underlying domain. In this thesis work, we focus on designing and applying data mining techniques to analyze spatial and spatiotemporal data originated in scientific domains. Examples of spatial and spatio-temporal data in scientific domains include data describing protein structures and data produced from protein folding simulations, respectively. Specifically, we have proposed a generalized framework to effectively discover different types of spatial and spatio-temporal patterns in scientific data sets. Such patterns can be used to capture a variety of interactions among objects of interest and the evolutionary behavior of such interactions. We have applied the framework to analyze data originated in the following three application domains: bioinformatics, computational molecular dynamics, and computational fluid dynamics. Empirical results demonstrate that the discovered patterns are meaningful in the underlying domain and can provide important insights into various scientific phenomena.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128624882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Twig Query Processing Under Concurrent Updates 并行更新下的小枝查询处理
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.156
Christian Mathis, T. Härder
An appropriate database language characteristics leading to the success of declarative query processing - and, in turn, to the rise of relational DBMSs in general - always provides more than one way of evaluating a query. This counts for structurally different but logically equivalent query evaluation plans (QEPs) as well as for different implementations of the same logical operator. This principle surely holds for the novel XML database management systems (XDBMSs): Recently proposed operators for XML query processing can be grouped into the logical operators Structural Join [1, 22] and Holistic Twig Join [3, 6, 16]. Depending on available internal system mechanisms, a lot of opportunities exist how to implement these operators (two of which are presented in this paper.
适当的数据库语言特征会导致声明性查询处理的成功——进而导致关系型dbms的兴起——总是提供不止一种计算查询的方法。这对于结构不同但逻辑等效的查询计算计划(qep)以及相同逻辑运算符的不同实现都很重要。这一原则确实适用于新的XML数据库管理系统(xdbms):最近提出的用于XML查询处理的操作符可以分为逻辑操作符Structural Join[1,22]和Holistic Twig Join[3,6,16]。根据可用的内部系统机制,存在许多实现这些操作符的机会(本文给出了其中两个)。
{"title":"Twig Query Processing Under Concurrent Updates","authors":"Christian Mathis, T. Härder","doi":"10.1109/ICDEW.2006.156","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.156","url":null,"abstract":"An appropriate database language characteristics leading to the success of declarative query processing - and, in turn, to the rise of relational DBMSs in general - always provides more than one way of evaluating a query. This counts for structurally different but logically equivalent query evaluation plans (QEPs) as well as for different implementations of the same logical operator. This principle surely holds for the novel XML database management systems (XDBMSs): Recently proposed operators for XML query processing can be grouped into the logical operators Structural Join [1, 22] and Holistic Twig Join [3, 6, 16]. Depending on available internal system mechanisms, a lot of opportunities exist how to implement these operators (two of which are presented in this paper.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127241066","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Dealing with Overload in Distributed Stream Processing Systems 分布式流处理系统中的过载处理
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.45
Nesime Tatbul, S. Zdonik
Overload management has been an important problem for large-scale dynamic systems. In this paper, we study this problem in the context of our Borealis distributed stream processing system. We show that server nodes must coordinate in their load shedding decisions to achieve global control on output quality. We describe a distributed load shedding approach which provides this coordination by upstream metadata aggregation and propagation. Metadata enables an upstream node to make fast local load shedding decisions which will influence its descendant nodes in the best possible way.
过载管理一直是大型动态系统的一个重要问题。本文在我们的Borealis分布式流处理系统的背景下研究了这个问题。我们表明服务器节点必须协调它们的减载决策,以实现对输出质量的全局控制。我们描述了一种分布式减载方法,该方法通过上游元数据聚合和传播提供这种协调。元数据使上游节点能够做出快速的本地减载决策,从而以最好的方式影响其后代节点。
{"title":"Dealing with Overload in Distributed Stream Processing Systems","authors":"Nesime Tatbul, S. Zdonik","doi":"10.1109/ICDEW.2006.45","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.45","url":null,"abstract":"Overload management has been an important problem for large-scale dynamic systems. In this paper, we study this problem in the context of our Borealis distributed stream processing system. We show that server nodes must coordinate in their load shedding decisions to achieve global control on output quality. We describe a distributed load shedding approach which provides this coordination by upstream metadata aggregation and propagation. Metadata enables an upstream node to make fast local load shedding decisions which will influence its descendant nodes in the best possible way.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114784442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Integration Workbench: Integrating Schema Integration Tools 集成工作台:集成模式集成工具
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.69
P. Mork, A. Rosenthal, Leonard J. Seligman, Joel Korb, Ken Samuel
A key aspect of any data integration endeavor is establishing a transformation that translates instances of one or more source schemata into instances of a target schema. This schema integration task must be tackled regardless of the integration architecture or mapping formalism. In this paper we provide a task model for schema integration. We use this breakdown to motivate a workbench for schema integration in which multiple tools share a common knowledge repository. In particular, the workbench facilitates the interoperation of research prototypes for schema matching (which automatically identify likely semantic correspondences) with commercial schema mapping tools (which help produce instance-level transformations). Currently, each of these tools provides its own ad hoc representation of schemata and mappings; combining these tools requires aligning these representations. The workbench provides a common representation so that these tools can more rapidly be combined.
任何数据集成工作的一个关键方面是建立转换,将一个或多个源模式的实例转换为目标模式的实例。无论采用何种集成体系结构或映射形式,都必须处理此模式集成任务。本文提出了一种模式集成的任务模型。我们使用这个分解来激发一个用于模式集成的工作台,其中多个工具共享一个公共知识存储库。特别是,工作台促进了模式匹配(自动识别可能的语义对应)与商业模式映射工具(帮助生成实例级转换)的研究原型的互操作。目前,这些工具中的每一个都提供了自己的模式和映射的特别表示;组合这些工具需要对齐这些表示。工作台提供了一个通用的表示,以便这些工具可以更快地组合在一起。
{"title":"Integration Workbench: Integrating Schema Integration Tools","authors":"P. Mork, A. Rosenthal, Leonard J. Seligman, Joel Korb, Ken Samuel","doi":"10.1109/ICDEW.2006.69","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.69","url":null,"abstract":"A key aspect of any data integration endeavor is establishing a transformation that translates instances of one or more source schemata into instances of a target schema. This schema integration task must be tackled regardless of the integration architecture or mapping formalism. In this paper we provide a task model for schema integration. We use this breakdown to motivate a workbench for schema integration in which multiple tools share a common knowledge repository. In particular, the workbench facilitates the interoperation of research prototypes for schema matching (which automatically identify likely semantic correspondences) with commercial schema mapping tools (which help produce instance-level transformations). Currently, each of these tools provides its own ad hoc representation of schemata and mappings; combining these tools requires aligning these representations. The workbench provides a common representation so that these tools can more rapidly be combined.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129312795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
期刊
22nd International Conference on Data Engineering Workshops (ICDEW'06)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1