首页 > 最新文献

22nd International Conference on Data Engineering Workshops (ICDEW'06)最新文献

英文 中文
PRIVATE-IYE: A Framework for Privacy Preserving Data Integration 私有- iye:一种保护隐私的数据集成框架
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.117
S. Bhowmick, L. Gruenwald, M. Iwaihara, S. Chatvichienchai
Data integration has been a long standing challenge to the database and data mining communities. This need has become critical in numerous contexts, including building e-commerce market places, sharing data from scientific research, and improving homeland security. However, these important activities are hampered by legitimate and widespread concerns of data privacy. It is necessary to develop solutions that enable integration of data, especially in the domains of national priorities, while effective privacy control of the data. In this paper, we present an architecture and key research issues for building such a privacy preserving data integration system called PRIVATE-IYE.
数据集成一直是数据库和数据挖掘社区面临的一个长期挑战。这一需求在许多情况下都变得至关重要,包括建立电子商务市场、共享科学研究数据和改善国土安全。然而,这些重要的活动受到合法和广泛的数据隐私担忧的阻碍。有必要制定解决方案,使数据能够整合,特别是在国家优先领域,同时对数据进行有效的隐私控制。本文提出了一种隐私保护数据集成系统PRIVATE-IYE的体系结构和关键研究问题。
{"title":"PRIVATE-IYE: A Framework for Privacy Preserving Data Integration","authors":"S. Bhowmick, L. Gruenwald, M. Iwaihara, S. Chatvichienchai","doi":"10.1109/ICDEW.2006.117","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.117","url":null,"abstract":"Data integration has been a long standing challenge to the database and data mining communities. This need has become critical in numerous contexts, including building e-commerce market places, sharing data from scientific research, and improving homeland security. However, these important activities are hampered by legitimate and widespread concerns of data privacy. It is necessary to develop solutions that enable integration of data, especially in the domains of national priorities, while effective privacy control of the data. In this paper, we present an architecture and key research issues for building such a privacy preserving data integration system called PRIVATE-IYE.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116056489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 37
Model Video Semantics with Constraints Considering Temporal Structure and Typed Events 考虑时间结构和类型事件约束的视频语义模型
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.94
Yu Wang, Lizhu Zhou, Jianyong Wang
The advances of video technology and video-related applications demand appropriate video semantic models for representing video data and their semantics, and supporting powerful semantic queries on them. In this paper, we propose such a model named SemTTE. The model incorporates features of temporal structure and typed events of video contents. It organizes the whole video into a tree of events, and provides mechanisms for users to define domain-specific constraints. As a result, the contents and semantics of the video can be better represented and queried. For constraints enforcement, an efficient on-line method is proposed.
视频技术和视频相关应用的发展需要合适的视频语义模型来表示视频数据及其语义,并支持对其进行强大的语义查询。在本文中,我们提出了一个名为SemTTE的模型。该模型结合了视频内容的时间结构特征和事件类型特征。它将整个视频组织成事件树,并为用户提供定义特定领域约束的机制。这样可以更好地表示和查询视频的内容和语义。对于约束的执行,提出了一种有效的在线执行方法。
{"title":"Model Video Semantics with Constraints Considering Temporal Structure and Typed Events","authors":"Yu Wang, Lizhu Zhou, Jianyong Wang","doi":"10.1109/ICDEW.2006.94","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.94","url":null,"abstract":"The advances of video technology and video-related applications demand appropriate video semantic models for representing video data and their semantics, and supporting powerful semantic queries on them. In this paper, we propose such a model named SemTTE. The model incorporates features of temporal structure and typed events of video contents. It organizes the whole video into a tree of events, and provides mechanisms for users to define domain-specific constraints. As a result, the contents and semantics of the video can be better represented and queried. For constraints enforcement, an efficient on-line method is proposed.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116405484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Trust Negotiation as an Authorization Service forWeb Services 作为web服务授权服务的信任协商
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.154
L.E. Olson, M. Winslett, G. Tonti, Nathan Seeley, Andrzej Uszok, J. Bradshaw
Like other open computing environments, web services need a scalable method of determining authorized users. We present desiderata for authorization facilities for web services, and analyze potential ways of satisfying them. We propose a third-party authorization system for web services based on trust negotiation, discuss its implementation using the TrustBuilder runtime system for trust negotiation, and present performance results from a stock trading application.
与其他开放计算环境一样,web服务需要一种可扩展的方法来确定授权用户。我们提出了对web服务授权工具的需求,并分析了满足这些需求的潜在方法。提出了一种基于信任协商的web服务第三方授权系统,讨论了基于信任协商的TrustBuilder运行时系统的实现,并给出了一个股票交易应用的性能结果。
{"title":"Trust Negotiation as an Authorization Service forWeb Services","authors":"L.E. Olson, M. Winslett, G. Tonti, Nathan Seeley, Andrzej Uszok, J. Bradshaw","doi":"10.1109/ICDEW.2006.154","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.154","url":null,"abstract":"Like other open computing environments, web services need a scalable method of determining authorized users. We present desiderata for authorization facilities for web services, and analyze potential ways of satisfying them. We propose a third-party authorization system for web services based on trust negotiation, discuss its implementation using the TrustBuilder runtime system for trust negotiation, and present performance results from a stock trading application.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123008262","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Semi-Supervised Clustering of XML Documents: Getting the Most from Structural Information XML文档的半监督聚类:从结构信息中获取最多
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.136
Eduardo Goncalves da Silva, M. Mattoso, G. Xexéo
As document providers can express more contextualized and complex information, semi-structured documents are becoming a major source of information in many areas, e.g., in digital libraries, e-commerce or Web applications. A particular characteristic of such document collections is the existence of some structure or metadata along with the data. In this scenario, clustering methods that can take advantage of such structural information to better organize such collections are highly relevant. Semi-structured documents pose new challenges to document clustering methods, however, since it is not clear how this structural information can be used to improve the quality of the generated clustering models. On the other hand, recently there has a growing interest in the semi-supervised clustering task, in which a little amount of prior knowledge is provided to guide the algorithm to a better clustering model. A particular type of semi-supervision is in the form of user-provided constraints defined over pairs of objects, where each pair informs if its objects must be in the same or in different clusters. In this paper, we consider the problem of constrained clustering in documents that present some form of structural information. We consider the existence of a particular form of information to be clustered: textual documents that present a logical structure represented in XML format. We define and extend methods to improve the quality of clustering results by using such structural information to guide the execution of the constrained clustering algorithm. Experimental results on the OHSUMED document collection show the effectiveness of our approach.
由于文档提供者可以表达更多的上下文化和复杂的信息,半结构化文档正在成为许多领域的主要信息来源,例如,在数字图书馆、电子商务或Web应用程序中。这种文档集合的一个特殊特征是在数据中存在某种结构或元数据。在这种情况下,能够利用这种结构信息来更好地组织这种集合的聚类方法是高度相关的。然而,半结构化文档对文档聚类方法提出了新的挑战,因为尚不清楚如何使用这些结构化信息来提高生成的聚类模型的质量。另一方面,近年来人们对半监督聚类任务越来越感兴趣,在半监督聚类任务中,提供少量的先验知识来指导算法获得更好的聚类模型。一种特殊类型的半监督是以用户提供的约束的形式定义的对象对,其中每对对象通知它的对象是否必须在相同或不同的集群中。在本文中,我们考虑了存在某种形式的结构信息的文档中的约束聚类问题。我们认为存在一种需要聚集的特殊形式的信息:以XML格式表示逻辑结构的文本文档。我们定义并扩展了一些方法,通过使用这些结构信息来指导约束聚类算法的执行来提高聚类结果的质量。OHSUMED文档集的实验结果表明了该方法的有效性。
{"title":"Semi-Supervised Clustering of XML Documents: Getting the Most from Structural Information","authors":"Eduardo Goncalves da Silva, M. Mattoso, G. Xexéo","doi":"10.1109/ICDEW.2006.136","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.136","url":null,"abstract":"As document providers can express more contextualized and complex information, semi-structured documents are becoming a major source of information in many areas, e.g., in digital libraries, e-commerce or Web applications. A particular characteristic of such document collections is the existence of some structure or metadata along with the data. In this scenario, clustering methods that can take advantage of such structural information to better organize such collections are highly relevant. Semi-structured documents pose new challenges to document clustering methods, however, since it is not clear how this structural information can be used to improve the quality of the generated clustering models. On the other hand, recently there has a growing interest in the semi-supervised clustering task, in which a little amount of prior knowledge is provided to guide the algorithm to a better clustering model. A particular type of semi-supervision is in the form of user-provided constraints defined over pairs of objects, where each pair informs if its objects must be in the same or in different clusters. In this paper, we consider the problem of constrained clustering in documents that present some form of structural information. We consider the existence of a particular form of information to be clustered: textual documents that present a logical structure represented in XML format. We define and extend methods to improve the quality of clustering results by using such structural information to guide the execution of the constrained clustering algorithm. Experimental results on the OHSUMED document collection show the effectiveness of our approach.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129916546","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Video Database Modeling and Temporal Pattern Retrieval using Hierarchical Markov Model Mediator 基于层次马尔可夫模型的视频数据库建模与时间模式检索
Pub Date : 2006-04-03 DOI: 10.1109/icdew.2006.162
Na Zhao, Shu‐Ching Chen, M. Shyu
The dream of pervasive multimedia retrieval and reuse will not be realized without incorporating semantics in the multimedia database. As video data is penetrating many information systems, the need for database support for video data evolves. Hence, we propose an innovative database modeling mechanism called Hierarchical Markov Model Mediator (HMMM) which integrates lowlevel features, semantic concepts, and high-level user perceptions for modeling and indexing multiple-level video objects to facilitate temporal pattern retrieval. Different from the existing database modeling methods, our approach carries a stochastic and dynamic process in both search and similarity calculation. In the retrieval of semantic event patterns, HMMM always tries to traverse the right path and therefore it can assist in retrieving more accurate patterns quickly with lower computational costs. Moreover, HMMM supports feedbacks and learning strategies, which can proficiently assure the continuous improvements of the overall performance.
如果不在多媒体数据库中加入语义,普及多媒体检索和重用的梦想就无法实现。随着视频数据渗透到许多信息系统中,对视频数据数据库支持的需求也在不断发展。因此,我们提出了一种创新的数据库建模机制,称为层次马尔可夫模型中介(hmm),它集成了低级特征、语义概念和高级用户感知,用于建模和索引多层视频对象,以促进时间模式检索。与现有的数据库建模方法不同,我们的方法在搜索和相似度计算中都带有随机和动态的过程。在语义事件模式的检索中,hmm总是尝试遍历正确的路径,因此它可以帮助以更低的计算成本快速检索更准确的模式。此外,hmm支持反馈和学习策略,可以熟练地保证整体绩效的持续改进。
{"title":"Video Database Modeling and Temporal Pattern Retrieval using Hierarchical Markov Model Mediator","authors":"Na Zhao, Shu‐Ching Chen, M. Shyu","doi":"10.1109/icdew.2006.162","DOIUrl":"https://doi.org/10.1109/icdew.2006.162","url":null,"abstract":"The dream of pervasive multimedia retrieval and reuse will not be realized without incorporating semantics in the multimedia database. As video data is penetrating many information systems, the need for database support for video data evolves. Hence, we propose an innovative database modeling mechanism called Hierarchical Markov Model Mediator (HMMM) which integrates lowlevel features, semantic concepts, and high-level user perceptions for modeling and indexing multiple-level video objects to facilitate temporal pattern retrieval. Different from the existing database modeling methods, our approach carries a stochastic and dynamic process in both search and similarity calculation. In the retrieval of semantic event patterns, HMMM always tries to traverse the right path and therefore it can assist in retrieving more accurate patterns quickly with lower computational costs. Moreover, HMMM supports feedbacks and learning strategies, which can proficiently assure the continuous improvements of the overall performance.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130606069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Integrating Databases into the Semantic Web through an Ontology-Based Framework 通过基于本体的框架将数据库集成到语义网中
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.68
D. Dou, P. LePendu, Shiwoong Kim, Peishen Qi
To realize the Semantic Web, it will be necessary to make existing database content available for emerging Semantic Web applications, such as web agents and services, which use ontologies to formally define the semantics of their data. Our research in the design and implementation of an ontology-based system, OntoGrate, addresses the critical and challenging problem of supporting human experts in multiple domains to interactively integrate information that is heterogenous in both structure and semantics. Databases, knowledge bases, the World Wide Web, and the emerging Semantic Web are some of the resources for which scalable integration remains a challenge. To integrate databases into the Semantic Web, we use Semantic Web ontologies to incorporate database schemas. An expressive first order ontology language, Web-PDDL, is used to define the structure, semantics, and mappings of data resources. A powerful inference engine, OntoEngine, can be used for query answering and data translation. In this paper, besides introducing new ideas in the OntoGrate system, we will elaborate on two case studies for which our system works well.
为了实现语义Web,有必要使现有的数据库内容可用于新兴的语义Web应用程序,例如使用本体正式定义其数据语义的Web代理和服务。我们在设计和实现基于本体的系统OntoGrate方面的研究,解决了支持多个领域的人类专家交互集成结构和语义异构的信息的关键和具有挑战性的问题。数据库、知识库、万维网和新兴的语义网是一些资源,可伸缩集成对它们来说仍然是一个挑战。为了将数据库集成到语义Web中,我们使用语义Web本体来合并数据库模式。一种富有表现力的一阶本体语言Web-PDDL用于定义数据资源的结构、语义和映射。一个强大的推理引擎,OntoEngine,可以用于查询回答和数据翻译。在本文中,除了介绍OntoGrate系统的新思想外,我们还将详细介绍两个我们的系统运行良好的案例研究。
{"title":"Integrating Databases into the Semantic Web through an Ontology-Based Framework","authors":"D. Dou, P. LePendu, Shiwoong Kim, Peishen Qi","doi":"10.1109/ICDEW.2006.68","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.68","url":null,"abstract":"To realize the Semantic Web, it will be necessary to make existing database content available for emerging Semantic Web applications, such as web agents and services, which use ontologies to formally define the semantics of their data. Our research in the design and implementation of an ontology-based system, OntoGrate, addresses the critical and challenging problem of supporting human experts in multiple domains to interactively integrate information that is heterogenous in both structure and semantics. Databases, knowledge bases, the World Wide Web, and the emerging Semantic Web are some of the resources for which scalable integration remains a challenge. To integrate databases into the Semantic Web, we use Semantic Web ontologies to incorporate database schemas. An expressive first order ontology language, Web-PDDL, is used to define the structure, semantics, and mappings of data resources. A powerful inference engine, OntoEngine, can be used for query answering and data translation. In this paper, besides introducing new ideas in the OntoGrate system, we will elaborate on two case studies for which our system works well.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130432816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Your Enterprise on XQuery and XML Schema: XML-based Data and Metadata Integration 基于XQuery和XML模式的企业:基于XML的数据和元数据集成
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.167
P. Reveliotis, M. Carey
This paper describes a declarative and unifying approach, based on XQuery and XML Schema, to modeling and accessing the variety of data source types found in a typical enterprise. These include relational, Web service, function-based (a.k.a. servicebased), and file-based data sources. The approach that we detail here is based on introspection of data source meta-information and generation of metadata artifacts that conform to a common model and that provide a uniform framework in which the data sources become available to an XQuery-based query processor. We explain how our approach addresses various aspects of data sources, including data source connectivity, data typing, integrity constraints, access control, and the need for performant access. We also explain why XQuery and XML Schema can serve as an excellent vehicle for data and metadata integration. The approach described in this paper is used in BEA's AquaLogic Data Services Platform product and serves as the substrate for its data service modeling concepts.
本文描述了一种基于XQuery和XML Schema的声明性统一方法,用于对典型企业中的各种数据源类型进行建模和访问。这些数据源包括关系数据源、Web服务数据源、基于函数的数据源(又称基于服务的数据源)和基于文件的数据源。我们在这里详细介绍的方法是基于数据源元信息的自省和元数据工件的生成,这些元数据工件符合一个公共模型,并提供一个统一的框架,在这个框架中,数据源可供基于xquery的查询处理器使用。我们将解释我们的方法如何处理数据源的各个方面,包括数据源连接性、数据类型、完整性约束、访问控制以及对高性能访问的需求。我们还解释了为什么XQuery和XML Schema可以作为数据和元数据集成的优秀工具。本文描述的方法用于BEA的AquaLogic数据服务平台产品,并作为其数据服务建模概念的基础。
{"title":"Your Enterprise on XQuery and XML Schema: XML-based Data and Metadata Integration","authors":"P. Reveliotis, M. Carey","doi":"10.1109/ICDEW.2006.167","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.167","url":null,"abstract":"This paper describes a declarative and unifying approach, based on XQuery and XML Schema, to modeling and accessing the variety of data source types found in a typical enterprise. These include relational, Web service, function-based (a.k.a. servicebased), and file-based data sources. The approach that we detail here is based on introspection of data source meta-information and generation of metadata artifacts that conform to a common model and that provide a uniform framework in which the data sources become available to an XQuery-based query processor. We explain how our approach addresses various aspects of data sources, including data source connectivity, data typing, integrity constraints, access control, and the need for performant access. We also explain why XQuery and XML Schema can serve as an excellent vehicle for data and metadata integration. The approach described in this paper is used in BEA's AquaLogic Data Services Platform product and serves as the substrate for its data service modeling concepts.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126590253","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Enabling ScientificWorkflow Reuse through Structured Composition of Dataflow and Control-Flow 通过数据流和控制流的结构化组合实现科学的工作流重用
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.55
S. Bowers, Bertram Ludäscher, A. Ngu, T. Critchlow
Data-centric scientific workflows are often modeled as dataflow process networks. The simplicity of the dataflow framework facilitates workflow design, analysis, and optimization. However, modeling "control-flow intensive" tasks using dataflow constructs often leads to overly complicated workflows that are hard to comprehend, reuse, and maintain. We describe a generic framework, based on scientific workflow templates and frames, for embedding control-flow intensive subtasks within dataflow process networks. This approach can seamlessly handle complex control-flow without sacrificing the benefits of dataflow. We illustrate our approach with a real-world scientific workflow from the astrophysics domain, requiring remote execution and file transfer in a semi-reliable environment. For such workflows, we also describe a 3-layered architecture based on frames and templates where the top-layer consists of an overall dataflow process network, the second layer consists of a tranducer template for modeling the desired control-flow behavior, and the bottom layer consists of frames inside the template that are specialized by embedding the desired component implementation. Our approach can enable scientific workflows that are more robust (faulttolerance strategies can be defined by control-flow driven transducer templates) and at the same time more reusable, since the embedding of frames and templates yields more structured and modular workflow designs.
以数据为中心的科学工作流通常被建模为数据流流程网络。数据流框架的简单性有助于工作流的设计、分析和优化。然而,使用数据流构造对“控制流密集型”任务进行建模通常会导致过于复杂的工作流,难以理解、重用和维护。我们基于科学的工作流模板和框架,描述了一个通用框架,用于在数据流过程网络中嵌入控制流密集型子任务。这种方法可以无缝地处理复杂的控制流,而不会牺牲数据流的优势。我们用来自天体物理学领域的真实世界的科学工作流程来说明我们的方法,需要在半可靠的环境中远程执行和文件传输。对于这样的工作流,我们还描述了一个基于框架和模板的三层架构,其中顶层由整体数据流处理网络组成,第二层由用于建模所需控制流行为的传感器模板组成,底层由模板内的框架组成,这些框架通过嵌入所需组件实现来实现专业化。我们的方法可以使科学工作流更健壮(容错策略可以由控制流驱动的传感器模板定义),同时更可重用,因为框架和模板的嵌入产生更结构化和模块化的工作流设计。
{"title":"Enabling ScientificWorkflow Reuse through Structured Composition of Dataflow and Control-Flow","authors":"S. Bowers, Bertram Ludäscher, A. Ngu, T. Critchlow","doi":"10.1109/ICDEW.2006.55","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.55","url":null,"abstract":"Data-centric scientific workflows are often modeled as dataflow process networks. The simplicity of the dataflow framework facilitates workflow design, analysis, and optimization. However, modeling \"control-flow intensive\" tasks using dataflow constructs often leads to overly complicated workflows that are hard to comprehend, reuse, and maintain. We describe a generic framework, based on scientific workflow templates and frames, for embedding control-flow intensive subtasks within dataflow process networks. This approach can seamlessly handle complex control-flow without sacrificing the benefits of dataflow. We illustrate our approach with a real-world scientific workflow from the astrophysics domain, requiring remote execution and file transfer in a semi-reliable environment. For such workflows, we also describe a 3-layered architecture based on frames and templates where the top-layer consists of an overall dataflow process network, the second layer consists of a tranducer template for modeling the desired control-flow behavior, and the bottom layer consists of frames inside the template that are specialized by embedding the desired component implementation. Our approach can enable scientific workflows that are more robust (faulttolerance strategies can be defined by control-flow driven transducer templates) and at the same time more reusable, since the embedding of frames and templates yields more structured and modular workflow designs.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130829331","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 82
Mining Popular Paths in a Transportation Database System with Privacy Protection 具有隐私保护的交通数据库系统中流行路径的挖掘
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.91
Chi Hong Cheong, M. Wong
This paper proposes an algorithm to identify popular paths in a transportation system, while the privacy of drivers is preserved. A popular path is one of the most frequently used routes between any two points in a road map. In order to identify popular paths with privacy protection, the algorithm figures out what information is useless for identifying popular paths, and this information is not revealed to the data mining system so that privacy is preserved. In addition, the system does not record the identifications of the vehicles. Moreover, in the mining process, the database does not contain complete path information. The experimental results verify the correctness of the proposed algorithm and show that the proposed algorithm is scalable.
本文提出了一种在保证驾驶员隐私的前提下识别交通系统中常用路径的算法。热门路径是指地图上任意两点之间最常使用的路线之一。为了识别具有隐私保护的流行路径,该算法找出哪些信息对识别流行路径是无用的,这些信息不向数据挖掘系统透露,以保护隐私。此外,该系统不记录车辆的身份。此外,在挖掘过程中,数据库不包含完整的路径信息。实验结果验证了所提算法的正确性,并表明所提算法具有可扩展性。
{"title":"Mining Popular Paths in a Transportation Database System with Privacy Protection","authors":"Chi Hong Cheong, M. Wong","doi":"10.1109/ICDEW.2006.91","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.91","url":null,"abstract":"This paper proposes an algorithm to identify popular paths in a transportation system, while the privacy of drivers is preserved. A popular path is one of the most frequently used routes between any two points in a road map. In order to identify popular paths with privacy protection, the algorithm figures out what information is useless for identifying popular paths, and this information is not revealed to the data mining system so that privacy is preserved. In addition, the system does not record the identifications of the vehicles. Moreover, in the mining process, the database does not contain complete path information. The experimental results verify the correctness of the proposed algorithm and show that the proposed algorithm is scalable.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133003843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Leveraging Windows Workflow Foundation for Scientific Workflows in Wind Tunnel Applications 利用Windows工作流基础在风洞应用中的科学工作流
Pub Date : 2006-04-03 DOI: 10.1109/ICDEW.2006.71
A. Paventhan, Kenji Takeda, S. Cox, D. Nicole
Scientific and engineering experiments often produce large volumes of data that must be processed and visualised in near-realtime. An example of this, described in this paper, is microphone array processing of data from wind tunnels for aeroacoustic measurements. The overall turnaround time from data acquisition and movement, to data processing and visualization is often inhibited by factors such as manual data movement, system interoperability issues, manual resource discovery for job scheduling, and disparate physical locality between the experiment and scientist or engineer post-event. Workflow frameworks and runtimes can enable rapid composition and execution of complex scientific workflows. In this paper we explore two approaches based on Windows Workflow Foundation, a component of Microsoft WinFX. In our first approach, we present a framework for users to compose sequential workflows and access Globus grid services seamlessly using a .NET-based Commodity Grid Toolkit (MyCoG.NET). We demonstrate how application specific activity sets can be developed and extended by users. In our second approach we highlight how it can be advantageous to keep databases as central to the complete workflow enactment. These two approaches are demonstrated in the context of a wind tunnel Grid system being developed to help experimental aerodynamicists orchestrate such workflows.
科学和工程实验经常产生大量的数据,这些数据必须在接近实时的情况下进行处理和可视化。本文描述的一个例子是对风洞数据的传声器阵列处理,用于气动声学测量。从数据采集和移动到数据处理和可视化的总体周转时间通常受到一些因素的限制,例如手动数据移动、系统互操作性问题、作业调度的手动资源发现,以及实验和科学家或工程师之间的不同物理位置。工作流框架和运行时可以支持复杂的科学工作流的快速组合和执行。在本文中,我们探讨了基于Windows Workflow Foundation (Microsoft WinFX的一个组件)的两种方法。在我们的第一种方法中,我们为用户提供了一个框架,使用户可以使用基于。net的商品网格工具包(MyCoG.NET)来组合连续的工作流和无缝地访问Globus网格服务。我们将演示用户如何开发和扩展特定于应用程序的活动集。在我们的第二种方法中,我们强调了将数据库作为整个工作流制定的中心是如何有利的。这两种方法在风洞网格系统的背景下进行了演示,该系统正在开发中,以帮助实验空气动力学家协调此类工作流程。
{"title":"Leveraging Windows Workflow Foundation for Scientific Workflows in Wind Tunnel Applications","authors":"A. Paventhan, Kenji Takeda, S. Cox, D. Nicole","doi":"10.1109/ICDEW.2006.71","DOIUrl":"https://doi.org/10.1109/ICDEW.2006.71","url":null,"abstract":"Scientific and engineering experiments often produce large volumes of data that must be processed and visualised in near-realtime. An example of this, described in this paper, is microphone array processing of data from wind tunnels for aeroacoustic measurements. The overall turnaround time from data acquisition and movement, to data processing and visualization is often inhibited by factors such as manual data movement, system interoperability issues, manual resource discovery for job scheduling, and disparate physical locality between the experiment and scientist or engineer post-event. Workflow frameworks and runtimes can enable rapid composition and execution of complex scientific workflows. In this paper we explore two approaches based on Windows Workflow Foundation, a component of Microsoft WinFX. In our first approach, we present a framework for users to compose sequential workflows and access Globus grid services seamlessly using a .NET-based Commodity Grid Toolkit (MyCoG.NET). We demonstrate how application specific activity sets can be developed and extended by users. In our second approach we highlight how it can be advantageous to keep databases as central to the complete workflow enactment. These two approaches are demonstrated in the context of a wind tunnel Grid system being developed to help experimental aerodynamicists orchestrate such workflows.","PeriodicalId":331953,"journal":{"name":"22nd International Conference on Data Engineering Workshops (ICDEW'06)","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2006-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124188956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
22nd International Conference on Data Engineering Workshops (ICDEW'06)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1