首页 > 最新文献

Proceedings 11th IEEE International Symposium on High Performance Distributed Computing最新文献

英文 中文
MySRB and SRB - components of a Data Grid MySRB和SRB——数据网格的组件
A. Rajasekar, M. Wan, Reagan Moore
Data Grids are becoming increasingly important in scientific communities for sharing large data collections and for archiving and disseminating them in a digital library framework. The Storage Resource Broker (SRB) provides transparent virtualized middleware for sharing data across distributed, heterogeneous data resources separated by different administrative and security domains. The MySRB is a Web-based interface to the SRB that provides a user-friendly interface to distributed collections brokered by the SRB. In this paper we briefly describe the use of the SRB infrastructure as tools in the data grid architecture for building distributed data collections, digital libraries, and persistent archives. We also provide details about the MySRB and its functionalities.
数据网格在科学界中变得越来越重要,用于共享大型数据集合,并在数字图书馆框架中对其进行归档和传播。存储资源代理(SRB)提供透明的虚拟化中间件,用于跨由不同管理和安全域分隔的分布式异构数据资源共享数据。MySRB是一个基于web的SRB接口,它为SRB代理的分布式集合提供了一个用户友好的界面。在本文中,我们简要描述了SRB基础设施作为数据网格体系结构中的工具的使用,用于构建分布式数据集合、数字图书馆和持久档案。我们还提供了有关MySRB及其功能的详细信息。
{"title":"MySRB and SRB - components of a Data Grid","authors":"A. Rajasekar, M. Wan, Reagan Moore","doi":"10.1109/HPDC.2002.1029930","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029930","url":null,"abstract":"Data Grids are becoming increasingly important in scientific communities for sharing large data collections and for archiving and disseminating them in a digital library framework. The Storage Resource Broker (SRB) provides transparent virtualized middleware for sharing data across distributed, heterogeneous data resources separated by different administrative and security domains. The MySRB is a Web-based interface to the SRB that provides a user-friendly interface to distributed collections brokered by the SRB. In this paper we briefly describe the use of the SRB infrastructure as tools in the data grid architecture for building distributed data collections, digital libraries, and persistent archives. We also provide details about the MySRB and its functionalities.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"45 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125146120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Flexibility, manageability, and performance in a Grid storage appliance 网格存储设备中的灵活性、可管理性和性能
John Bent, Venkateshwaran Venkataramani, Nick LeRoy, A. Roy, Joseph Stanley, A. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, M. Livny
We present NeST a flexible software-only storage appliance designed to meet the storage needs of the Grid. NeST has three key features that make it well-suited for deployment in a Grid environment. First, NeST provides a generic data transfer architecture that supports multiple data transfer protocols (including GridFTP and NFS), and allows for the easy addition of new protocols. Second, NeST is dynamic, adapting itself on-the-fly so that it runs effectively on a wide range of hardware and software platforms. Third, NeST is Grid-aware, implying that features that are necessary for integration into the Grid, such as storage space guarantees, mechanisms for resource and data discovery, user authentication, and quality of service, are a part of the NeST infrastructure.
我们向NeST展示了一种灵活的纯软件存储设备,旨在满足网格的存储需求。NeST有三个关键特性,使其非常适合在网格环境中部署。首先,NeST提供了一种通用的数据传输体系结构,它支持多种数据传输协议(包括GridFTP和NFS),并允许轻松添加新协议。其次,NeST是动态的,可以随时调整自己,以便在各种硬件和软件平台上有效运行。第三,NeST是网格感知的,这意味着集成到网格中所必需的特性,如存储空间保证、资源和数据发现机制、用户身份验证和服务质量,都是NeST基础设施的一部分。
{"title":"Flexibility, manageability, and performance in a Grid storage appliance","authors":"John Bent, Venkateshwaran Venkataramani, Nick LeRoy, A. Roy, Joseph Stanley, A. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, M. Livny","doi":"10.1109/HPDC.2002.1029898","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029898","url":null,"abstract":"We present NeST a flexible software-only storage appliance designed to meet the storage needs of the Grid. NeST has three key features that make it well-suited for deployment in a Grid environment. First, NeST provides a generic data transfer architecture that supports multiple data transfer protocols (including GridFTP and NFS), and allows for the easy addition of new protocols. Second, NeST is dynamic, adapting itself on-the-fly so that it runs effectively on a wide range of hardware and software platforms. Third, NeST is Grid-aware, implying that features that are necessary for integration into the Grid, such as storage space guarantees, mechanisms for resource and data discovery, user authentication, and quality of service, are a part of the NeST infrastructure.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117151034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 77
Design and evaluation of a resource selection framework for Grid applications 网格应用程序资源选择框架的设计和评估
Chuang Liu, Lingyun Yang, Ian T Foster, D. Angulo
While distributed, heterogeneous collections of computers ("Grids") can in principle be used as a computing platform, in practice the problems of first discovering and then organizing resources to meet application requirements are difficult. We present a general-purpose resource selection framework that addresses these problems by defining a resource selection service for locating Grid resources that match application requirements. At the heart of this framework is a simple, but powerful, declarative language based on a technique called set matching, which extends the Condor matchmaking framework to support both single-resource and multiple-resource selection. This framework also provides an open interface for loading application-specific mapping modules to personalize the resource selector. We present results obtained when this framework is applied in the context of a computational astrophysics application, Cactus. These results demonstrate the effectiveness of our technique.
虽然分布式的、异构的计算机集合(“网格”)原则上可以用作计算平台,但在实践中,首先发现然后组织资源以满足应用程序需求的问题是困难的。我们提出了一个通用的资源选择框架,它通过定义一个资源选择服务来定位符合应用程序需求的网格资源,从而解决了这些问题。该框架的核心是一种简单但功能强大的声明性语言,它基于一种称为集合匹配的技术,它扩展了Condor匹配框架,以支持单资源和多资源选择。该框架还提供了一个开放接口,用于加载特定于应用程序的映射模块,以个性化资源选择器。我们介绍了将该框架应用于计算天体物理学应用Cactus时获得的结果。这些结果证明了我们的技术的有效性。
{"title":"Design and evaluation of a resource selection framework for Grid applications","authors":"Chuang Liu, Lingyun Yang, Ian T Foster, D. Angulo","doi":"10.1109/HPDC.2002.1029904","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029904","url":null,"abstract":"While distributed, heterogeneous collections of computers (\"Grids\") can in principle be used as a computing platform, in practice the problems of first discovering and then organizing resources to meet application requirements are difficult. We present a general-purpose resource selection framework that addresses these problems by defining a resource selection service for locating Grid resources that match application requirements. At the heart of this framework is a simple, but powerful, declarative language based on a technique called set matching, which extends the Condor matchmaking framework to support both single-resource and multiple-resource selection. This framework also provides an open interface for loading application-specific mapping modules to personalize the resource selector. We present results obtained when this framework is applied in the context of a computational astrophysics application, Cactus. These results demonstrate the effectiveness of our technique.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125130674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 250
InfoGram: a grid service that supports both information queries and job execution InfoGram:一种网格服务,既支持信息查询,也支持作业执行
G. Laszewski, Jarek Gawor, C. Peña, Ian T Foster
The research described in this paper is performed as part of the Globus Project. It introduces a new grid service called InfoGram that combines the ability of serving as information service and as a job execution service. Previously, both services were architected and implemented within the Globus Toolkit as two different services with different wire protocols. Our service demonstrates a significant simplification of the architecture while treating job submissions and information queries alike. The advantage of our service is that it provides backwards compatibility to existing grid services, while at the same time providing forwards compatibility to the emerging Web services world. Part of the work conducted within this effort is already reused by the current open grid services architecture prototype implementation.
本文中描述的研究是作为Globus项目的一部分进行的。它引入了一个名为InfoGram的新网格服务,它结合了作为信息服务和作为作业执行服务的能力。以前,这两个服务在Globus Toolkit中被架构和实现为具有不同连接协议的两个不同服务。我们的服务在处理作业提交和信息查询时显著简化了体系结构。我们的服务的优势在于它提供了对现有网格服务的向后兼容性,同时为新兴的Web服务世界提供了向前兼容性。在这项工作中进行的部分工作已经被当前的开放网格服务架构原型实现重用。
{"title":"InfoGram: a grid service that supports both information queries and job execution","authors":"G. Laszewski, Jarek Gawor, C. Peña, Ian T Foster","doi":"10.1109/HPDC.2002.1029933","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029933","url":null,"abstract":"The research described in this paper is performed as part of the Globus Project. It introduces a new grid service called InfoGram that combines the ability of serving as information service and as a job execution service. Previously, both services were architected and implemented within the Globus Toolkit as two different services with different wire protocols. Our service demonstrates a significant simplification of the architecture while treating job submissions and information queries alike. The advantage of our service is that it provides backwards compatibility to existing grid services, while at the same time providing forwards compatibility to the emerging Web services world. Part of the work conducted within this effort is already reused by the current open grid services architecture prototype implementation.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125314094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Virtual services in data grids 数据网格中的虚拟服务
A. Jagatheesan, Reagan Moore, A. Rajasekar, Bing Zhu
Data grids enable next generation scientific explorations that require intensive computation and analysis of petabyte-scale shared data collections. Apart from the challenge in creation and management of the data, another major challenge is the discovery of derived data products that have already been created. This work addresses the later challenge in minimizing response time and conserving the computing cycles in the grids by using "virtual services" to access derived data products. We analyze the role of virtual services for long running services in a service-oriented architecture and contribute a technique called "semantic mirror". A prototype was developed to discover derived data products of grid services using SRB-MCAT technology and GriPhyN Virtual Data Language (VDL).
数据网格使下一代科学探索成为可能,这些探索需要对pb级共享数据集合进行密集计算和分析。除了创建和管理数据方面的挑战之外,另一个主要挑战是发现已经创建的派生数据产品。这项工作通过使用“虚拟服务”访问衍生数据产品,解决了最小化响应时间和节省网格计算周期的后期挑战。我们分析了虚拟服务在面向服务的体系结构中长期运行的服务中的作用,并提供了一种称为“语义镜像”的技术。利用SRB-MCAT技术和GriPhyN虚拟数据语言(VDL)开发了网格服务派生数据产品的原型。
{"title":"Virtual services in data grids","authors":"A. Jagatheesan, Reagan Moore, A. Rajasekar, Bing Zhu","doi":"10.1109/HPDC.2002.1029950","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029950","url":null,"abstract":"Data grids enable next generation scientific explorations that require intensive computation and analysis of petabyte-scale shared data collections. Apart from the challenge in creation and management of the data, another major challenge is the discovery of derived data products that have already been created. This work addresses the later challenge in minimizing response time and conserving the computing cycles in the grids by using \"virtual services\" to access derived data products. We analyze the role of virtual services for long running services in a service-oriented architecture and contribute a technique called \"semantic mirror\". A prototype was developed to discover derived data products of grid services using SRB-MCAT technology and GriPhyN Virtual Data Language (VDL).","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122054132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Backtracking and re-execution in the automatic debugging of parallelized programs 并行程序自动调试中的回溯和重新执行
G. Matthews, R. Hood, Stephen Johnson, P. Leggett
In this work we describe a new approach using relative debugging to find differences in computation between a serial program and a parallel version of that program. We use a combination of re-execution and backtracking in order to find the first difference in computation that may ultimately lead to an incorrect value that the user has indicated. In our prototype implementation we use static analysis information from a parallelization tool in order to perform the backtracking as well as the mapping required between serial and parallel computations.
在这项工作中,我们描述了一种新的方法,使用相对调试来发现串行程序和该程序的并行版本之间的计算差异。我们使用重新执行和回溯的组合,以便找到可能最终导致用户所指示的不正确值的计算中的第一个差异。在我们的原型实现中,我们使用来自并行化工具的静态分析信息来执行回溯以及串行和并行计算之间所需的映射。
{"title":"Backtracking and re-execution in the automatic debugging of parallelized programs","authors":"G. Matthews, R. Hood, Stephen Johnson, P. Leggett","doi":"10.1109/HPDC.2002.1029913","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029913","url":null,"abstract":"In this work we describe a new approach using relative debugging to find differences in computation between a serial program and a parallel version of that program. We use a combination of re-execution and backtracking in order to find the first difference in computation that may ultimately lead to an incorrect value that the user has indicated. In our prototype implementation we use static analysis information from a parallelization tool in order to perform the backtracking as well as the mapping required between serial and parallel computations.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"119 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133852289","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
The performance of processor co-allocation in multicluster systems 多集群系统中处理器协同分配的性能
A. Bucur, D. Epema
In systems consisting of multiple clusters of processors interconnected by relatively slow communication links, co-allocation may be required. We study its performance by means of simulations, depending on the structure and sizes of jobs, and the communication speed ratio. We model a multicluster with C clusters of identical processors. The workload consists of rigid jobs that require fixed numbers of processors, possibly in multiple clusters simultaneously. A job is represented by a tuple of C values, each generated from a same distribution D. In an ordered request the positions of the components in the tuple specify the clusters from which the processors must be allocated. For an unordered request, by the components of the tuple the job only specifies the numbers of processors needed in the separate clusters. A flexible request specifies the total number of processors, obtained as the sum of the values in the tuple. For total requests, there is a single cluster and a request only specifies the total number of processors needed. All intracluster communication links have the same speed, as do all intercluster links. The communication speed ratio is the ratio between the time needed to complete a send operation between processors in different clusters and in the same cluster.
在由多个处理器集群组成的系统中,通过相对较慢的通信链路相互连接,可能需要共同分配。我们通过仿真研究了它的性能,根据作业的结构和大小,以及通信速比。我们建立了一个多集群模型,其中C个集群具有相同的处理器。工作负载由刚性作业组成,这些作业需要固定数量的处理器,可能同时在多个集群中。作业由C个值组成的元组表示,每个值都从相同的分布d中生成。在有序请求中,元组中组件的位置指定必须从其中分配处理器的集群。对于无序请求,通过元组的组件,作业仅指定单独集群中所需的处理器数量。灵活的请求指定处理器的总数,以元组中值的总和获得。对于总请求,只有一个集群,请求只指定所需的处理器总数。所有集群内通信链路和集群间通信链路具有相同的速度。通信速比是在不同集群和同一集群中的处理器之间完成发送操作所需的时间之比。
{"title":"The performance of processor co-allocation in multicluster systems","authors":"A. Bucur, D. Epema","doi":"10.1109/HPDC.2002.1029944","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029944","url":null,"abstract":"In systems consisting of multiple clusters of processors interconnected by relatively slow communication links, co-allocation may be required. We study its performance by means of simulations, depending on the structure and sizes of jobs, and the communication speed ratio. We model a multicluster with C clusters of identical processors. The workload consists of rigid jobs that require fixed numbers of processors, possibly in multiple clusters simultaneously. A job is represented by a tuple of C values, each generated from a same distribution D. In an ordered request the positions of the components in the tuple specify the clusters from which the processors must be allocated. For an unordered request, by the components of the tuple the job only specifies the numbers of processors needed in the separate clusters. A flexible request specifies the total number of processors, obtained as the sum of the values in the tuple. For total requests, there is a single cluster and a request only specifies the total number of processors needed. All intracluster communication links have the same speed, as do all intercluster links. The communication speed ratio is the ratio between the time needed to complete a send operation between processors in different clusters and in the same cluster.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127251751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Distributed computing with load-managed active storage 具有负载管理的活动存储的分布式计算
Rajiv Wickremesinghe, J. Chase, J. Vitter
One approach to high-performance processing of massive data sets is to incorporate computation into storage systems. Previous work has shown that this active storage model is effective for a variety of problems. This paper explores opportunities to use active storage as a basis for exploiting asymmetric parallelism in applications using a streaming computation model on collections of fixed-size records. This model is the basis for much of the research in I/O-efficient algorithms, which deals with an important class of massive data problems not studied in previous work on active storage. We present an extension of a streaming computation model for an external memory toolkit to support a flexible mapping of computations to storage-based processors. Our approach enables load-managed active storage: it exposes parallelism, ordering constraints, and primitive computation units to the system, which can configure the application to balance load and make the best use of available processing power Emulation results from a sorting application demonstrate the potential of dynamic adaptation in load-managed active storage.
对海量数据集进行高性能处理的一种方法是将计算集成到存储系统中。以前的工作表明,这种主动存储模型对各种问题都是有效的。本文探讨了在使用固定大小记录集合的流计算模型的应用程序中使用主动存储作为利用非对称并行性的基础的机会。这个模型是很多I/ o高效算法研究的基础,这些算法处理的是一类重要的海量数据问题,这些问题在以前的主动存储工作中没有研究过。我们提出了一个外部内存工具包的流计算模型的扩展,以支持计算到基于存储的处理器的灵活映射。我们的方法支持负载管理的活动存储:它向系统公开了并行性、排序约束和基本计算单元,系统可以配置应用程序来平衡负载并充分利用可用的处理能力,排序应用程序的仿真结果证明了负载管理的活动存储中动态适应的潜力。
{"title":"Distributed computing with load-managed active storage","authors":"Rajiv Wickremesinghe, J. Chase, J. Vitter","doi":"10.1109/HPDC.2002.1029899","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029899","url":null,"abstract":"One approach to high-performance processing of massive data sets is to incorporate computation into storage systems. Previous work has shown that this active storage model is effective for a variety of problems. This paper explores opportunities to use active storage as a basis for exploiting asymmetric parallelism in applications using a streaming computation model on collections of fixed-size records. This model is the basis for much of the research in I/O-efficient algorithms, which deals with an important class of massive data problems not studied in previous work on active storage. We present an extension of a streaming computation model for an external memory toolkit to support a flexible mapping of computations to storage-based processors. Our approach enables load-managed active storage: it exposes parallelism, ordering constraints, and primitive computation units to the system, which can configure the application to balance load and make the best use of available processing power Emulation results from a sorting application demonstrate the potential of dynamic adaptation in load-managed active storage.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"36 1-2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114046745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Scalability of a distributed neural information retrieval system 分布式神经信息检索系统的可扩展性
M. Weeks, Victoria J. Hodge, J. Austin
Summary form only given. AURA (Advanced Uncertain Reasoning Architecture) is a generic family of techniques and implementations intended for high-speed approximate search and match operations on large unstructured datasets. AURA technology is fast, economical, and offers unique advantages for finding near-matches not available with other methods. AURA is based upon a high-performance binary neural network called a correlation matrix memory (CMM). Typically, several CMM elements are used in combination to solve soft or fuzzy pattern-matching problems. AURA takes large volumes of data and constructs a special type of compressed index. AURA finds exact and near-matches between indexed records and a given query, where the query itself may have omissions and errors. The degree of nearness required during matching can be varied through thresholding techniques. The PCI-based PRESENCE (Parallel Structured Neural Computing Engine) card is a hardware-accelerator architecture for the core CMM computations needed in AURA-based applications. The card is designed for use in low-cost workstations and incorporates 128 MByte of low-cost DRAM for CMM storage. To investigate the scalability of the distributed AURA system, we implement a word-to-document index of an AURA-based information retrieval system, called MinerTaur, over a distributed PRESENCE CMM.
只提供摘要形式。AURA(高级不确定推理体系结构)是一个通用的技术和实现家族,旨在对大型非结构化数据集进行高速近似搜索和匹配操作。AURA技术快速、经济,在寻找其他方法无法找到的接近匹配物方面具有独特的优势。AURA基于一种称为相关矩阵存储器(CMM)的高性能二进制神经网络。通常,几个CMM元素组合使用来解决软或模糊模式匹配问题。AURA需要大量数据,并构建一种特殊类型的压缩索引。AURA查找索引记录和给定查询之间的精确匹配和接近匹配,其中查询本身可能有遗漏和错误。匹配过程中所需的接近程度可以通过阈值技术来改变。基于pci的PRESENCE(并行结构化神经计算引擎)卡是一种硬件加速器架构,用于基于aura的应用程序所需的核心CMM计算。该卡设计用于低成本工作站,并包含128 MByte的低成本DRAM用于CMM存储。为了研究分布式AURA系统的可扩展性,我们在分布式PRESENCE CMM上实现了一个基于AURA的信息检索系统(称为MinerTaur)的word-to-document索引。
{"title":"Scalability of a distributed neural information retrieval system","authors":"M. Weeks, Victoria J. Hodge, J. Austin","doi":"10.1109/HPDC.2002.1029953","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029953","url":null,"abstract":"Summary form only given. AURA (Advanced Uncertain Reasoning Architecture) is a generic family of techniques and implementations intended for high-speed approximate search and match operations on large unstructured datasets. AURA technology is fast, economical, and offers unique advantages for finding near-matches not available with other methods. AURA is based upon a high-performance binary neural network called a correlation matrix memory (CMM). Typically, several CMM elements are used in combination to solve soft or fuzzy pattern-matching problems. AURA takes large volumes of data and constructs a special type of compressed index. AURA finds exact and near-matches between indexed records and a given query, where the query itself may have omissions and errors. The degree of nearness required during matching can be varied through thresholding techniques. The PCI-based PRESENCE (Parallel Structured Neural Computing Engine) card is a hardware-accelerator architecture for the core CMM computations needed in AURA-based applications. The card is designed for use in low-cost workstations and incorporates 128 MByte of low-cost DRAM for CMM storage. To investigate the scalability of the distributed AURA system, we implement a word-to-document index of an AURA-based information retrieval system, called MinerTaur, over a distributed PRESENCE CMM.","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122863096","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A case for proactivity in directory services 目录服务中主动性的一个案例
F. Bustamante, Patrick M. Widener, K. Schwan
In this paper, we argue that an exclusively inactive interface to directory services can hinder server scalability and indirectly restrict the behavior of potential applications. We propose to extend directory services' interfaces with a proactive mode by which clients can express their interest in (and be notified of) changes in the environment. These notification channels can be subsequently customized on a per-client basis through client-specified filters. Finally, in order to simplify the handling of client/server failures we adopt a leasing model for client registration to (and customization of) a notification channel. To validate our approach, we have designed and implemented the Proactive Directory Service (PDS).
在本文中,我们认为目录服务的专用非活动接口会阻碍服务器的可伸缩性,并间接限制潜在应用程序的行为。我们建议用一种主动模式扩展目录服务的接口,通过这种模式,客户端可以表达他们对环境变化的兴趣(并被通知)。随后,可以通过客户端指定的过滤器在每个客户端基础上定制这些通知通道。最后,为了简化客户端/服务器故障的处理,我们采用了一种租赁模型,用于客户端注册(和定制)通知通道。为了验证我们的方法,我们设计并实现了主动目录服务(PDS)。
{"title":"A case for proactivity in directory services","authors":"F. Bustamante, Patrick M. Widener, K. Schwan","doi":"10.1109/HPDC.2002.1029945","DOIUrl":"https://doi.org/10.1109/HPDC.2002.1029945","url":null,"abstract":"In this paper, we argue that an exclusively inactive interface to directory services can hinder server scalability and indirectly restrict the behavior of potential applications. We propose to extend directory services' interfaces with a proactive mode by which clients can express their interest in (and be notified of) changes in the environment. These notification channels can be subsequently customized on a per-client basis through client-specified filters. Finally, in order to simplify the handling of client/server failures we adopt a leasing model for client registration to (and customization of) a notification channel. To validate our approach, we have designed and implemented the Proactive Directory Service (PDS).","PeriodicalId":279053,"journal":{"name":"Proceedings 11th IEEE International Symposium on High Performance Distributed Computing","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122875628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings 11th IEEE International Symposium on High Performance Distributed Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1