首页 > 最新文献

CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.最新文献

英文 中文
Multi-class applications for parallel usage of a guaranteed rate and a scavenger service 多类应用程序并行使用的保证率和清道夫服务
M. Fidler, V. Sander
Grid computing requires network services beyond what is currently provided by the Best-Effort Internet. Among the different approaches towards network Quality of Service, aggregate scheduling, which maps micro-flows to a small number of different service classes, offers sufficient scalability up to the size of the Internet. The Differentiated Services architecture of the Internet Engineering Task Force is such an approach. Recently further Best-Effort like services have been proposed that are based on aggregate scheduling, like the less than Best-Effort Scavenger Service and the Alternative Best-Effort Service. In this paper we introduce the existing aggregate based approaches to Quality of Service. We take a heterogeneous mix of Transmission Control Protocol and User Datagram Protocol flows into account and complement services by adding transport protocol specific elements, if appropriate. In particular we show multi-class applications that are designed to apply a Guaranteed Rate Service and a Scavenger Service in parallel. Doing so we can show a performance gain and achieve a more economical use of the available resources without impacting responsive Best-Effort flows.
网格计算需要的网络服务超出了目前由Best-Effort Internet提供的服务。在实现网络服务质量的不同方法中,聚合调度(它将微流映射到少量不同的服务类别)提供了足够的可伸缩性,可以满足Internet的大小。Internet工程任务组的差异化服务体系结构就是这样一种方法。最近,进一步提出了基于聚合调度的类最佳努力服务,如less Best-Effort Scavenger Service和Alternative Best-Effort Service。本文介绍了现有的基于聚合的服务质量方法。我们考虑了传输控制协议和用户数据报协议流的异构混合,并在适当的情况下通过添加特定于传输协议的元素来补充服务。特别地,我们展示了多类应用程序,这些应用程序被设计为并行应用保证费率服务和清道夫服务。这样做,我们可以显示性能的提高,并在不影响响应性最佳努力流的情况下更经济地使用可用资源。
{"title":"Multi-class applications for parallel usage of a guaranteed rate and a scavenger service","authors":"M. Fidler, V. Sander","doi":"10.1109/CCGRID.2003.1199413","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199413","url":null,"abstract":"Grid computing requires network services beyond what is currently provided by the Best-Effort Internet. Among the different approaches towards network Quality of Service, aggregate scheduling, which maps micro-flows to a small number of different service classes, offers sufficient scalability up to the size of the Internet. The Differentiated Services architecture of the Internet Engineering Task Force is such an approach. Recently further Best-Effort like services have been proposed that are based on aggregate scheduling, like the less than Best-Effort Scavenger Service and the Alternative Best-Effort Service. In this paper we introduce the existing aggregate based approaches to Quality of Service. We take a heterogeneous mix of Transmission Control Protocol and User Datagram Protocol flows into account and complement services by adding transport protocol specific elements, if appropriate. In particular we show multi-class applications that are designed to apply a Guaranteed Rate Service and a Scavenger Service in parallel. Doing so we can show a performance gain and achieve a more economical use of the available resources without impacting responsive Best-Effort flows.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121556913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards a framework for collaborative peer groups 建立协作同伴小组的框架
V. Sunderam, James S. Pascoe, R. Loader
We propose the notion of 'collaborative peer groups', defined as peer-to-peer overlay networks with controlled membership and multiway communication primitives that offer well-defined semantics. Peers join such groups subject to symmetric acceptance, typically based on functional commonalities and, optionally, group-specific authentication. Collaborative peer group networks share the same properties as other peer-to-peer networks, including full decentralization, symmetric abilities, and dynamism. In addition, however, an extensible set of multiway communication primitives, especially appropriate for such peer groups, is provided and supports operations such as reliable message delivery to proximal group members or a subset thereof, message aggregation from peers, and discovery of peers supporting specific functional attributes. Based on several current and emerging application scenarios, we motivate and present the proposed collaborative peer group model, outline the group management architecture, and describe the initial set of communication primitives to be supported. A discussion of the toolkit development methodology and preliminary experiences concludes the paper.
我们提出了“协作对等组”的概念,将其定义为具有受控成员和提供良好定义语义的多路通信原语的对等覆盖网络。对等点以对称接受的方式加入这样的组,通常基于功能共性和(可选的)特定于组的身份验证。协作对等组网络与其他对等网络共享相同的属性,包括完全去中心化、对称能力和动态性。此外,还提供了一组可扩展的多路通信原语,特别适用于这样的对等组,并支持诸如向最近的组成员或其子集可靠地传递消息、来自对等点的消息聚合以及发现支持特定功能属性的对等点等操作。基于几个当前和新兴的应用场景,我们提出了建议的协作对等组模型,概述了组管理体系结构,并描述了要支持的初始通信原语集。对工具包开发方法和初步经验的讨论结束了本文。
{"title":"Towards a framework for collaborative peer groups","authors":"V. Sunderam, James S. Pascoe, R. Loader","doi":"10.1109/CCGRID.2003.1199397","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199397","url":null,"abstract":"We propose the notion of 'collaborative peer groups', defined as peer-to-peer overlay networks with controlled membership and multiway communication primitives that offer well-defined semantics. Peers join such groups subject to symmetric acceptance, typically based on functional commonalities and, optionally, group-specific authentication. Collaborative peer group networks share the same properties as other peer-to-peer networks, including full decentralization, symmetric abilities, and dynamism. In addition, however, an extensible set of multiway communication primitives, especially appropriate for such peer groups, is provided and supports operations such as reliable message delivery to proximal group members or a subset thereof, message aggregation from peers, and discovery of peers supporting specific functional attributes. Based on several current and emerging application scenarios, we motivate and present the proposed collaborative peer group model, outline the group management architecture, and describe the initial set of communication primitives to be supported. A discussion of the toolkit development methodology and preliminary experiences concludes the paper.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122140486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
Improving performance via computational replication on a large-scale computational grid 通过大规模计算网格上的计算复制提高性能
Yaohang Li, M. Mascagni
High performance computing on a large-scale computational grid is complicated by the heterogeneous computational capabilities of each node, node unavailability, and unreliable network connectivity. Replicating computation on multiple nodes can significantly improve performance by reducing task completion time on a grid's dynamic environment. We develop an analytical model to determine the number of task replicas to meet the performance goals in different computational grid configurations. Furthermore, taking advantage of the statistical nature of grid-based Monte Carlo applications, we extend the computational replication technique to an N-out-of-M scheduling strategy for grid-based Monte Carlo applications, which can potentially form a large category of grid-computing applications. In addition, we establish a corresponding model for the N-out-of-M scheduling mechanism. Simulations are used to validate the computational replication models. Our preliminary results show that the models we use are effective in predicting the required number of replicas to achieve short task completion time with a given high probability.
大规模计算网格上的高性能计算由于每个节点的异构计算能力、节点不可用性和不可靠的网络连接而变得复杂。在多个节点上复制计算可以通过减少网格动态环境中的任务完成时间来显著提高性能。我们开发了一个分析模型来确定任务副本的数量,以满足不同计算网格配置下的性能目标。此外,利用基于网格的蒙特卡罗应用程序的统计特性,我们将计算复制技术扩展到基于网格的蒙特卡罗应用程序的n -out- m调度策略,这可能会形成一个大型的网格计算应用程序类别。此外,我们还建立了n -out- m调度机制的相应模型。仿真用于验证计算复制模型。我们的初步结果表明,我们使用的模型在预测所需的副本数量以实现给定高概率的短任务完成时间方面是有效的。
{"title":"Improving performance via computational replication on a large-scale computational grid","authors":"Yaohang Li, M. Mascagni","doi":"10.1109/CCGRID.2003.1199399","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199399","url":null,"abstract":"High performance computing on a large-scale computational grid is complicated by the heterogeneous computational capabilities of each node, node unavailability, and unreliable network connectivity. Replicating computation on multiple nodes can significantly improve performance by reducing task completion time on a grid's dynamic environment. We develop an analytical model to determine the number of task replicas to meet the performance goals in different computational grid configurations. Furthermore, taking advantage of the statistical nature of grid-based Monte Carlo applications, we extend the computational replication technique to an N-out-of-M scheduling strategy for grid-based Monte Carlo applications, which can potentially form a large category of grid-computing applications. In addition, we establish a corresponding model for the N-out-of-M scheduling mechanism. Simulations are used to validate the computational replication models. Our preliminary results show that the models we use are effective in predicting the required number of replicas to achieve short task completion time with a given high probability.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"317 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116363978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
Cluster infrastructure for biological and health related research 生物和健康相关研究的集群基础设施
Sophia Corsava, V. Getov
Researchers in the biological and health industries need powerful and stable systems for their work. These systems must be dependable, fault-tolerant, highly available and easy to use. To cope with these demands we propose the use of computational and data clusters in a fail-over configuration combined with the grid technology and job scheduling. Our infrastructure has been deployed successfully for running time-critical applications in commercial environments. We also present experimental results from this pilot implementation that demonstrate the viability of our approach.
生物和健康行业的研究人员需要强大而稳定的系统来进行工作。这些系统必须可靠、容错、高可用性和易于使用。为了应对这些需求,我们提出在故障转移配置中使用计算和数据集群,并结合网格技术和作业调度。我们的基础设施已经成功部署,用于在商业环境中运行时间关键型应用程序。我们还介绍了这个试点实施的实验结果,证明了我们的方法的可行性。
{"title":"Cluster infrastructure for biological and health related research","authors":"Sophia Corsava, V. Getov","doi":"10.1109/CCGRID.2003.1199416","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199416","url":null,"abstract":"Researchers in the biological and health industries need powerful and stable systems for their work. These systems must be dependable, fault-tolerant, highly available and easy to use. To cope with these demands we propose the use of computational and data clusters in a fail-over configuration combined with the grid technology and job scheduling. Our infrastructure has been deployed successfully for running time-critical applications in commercial environments. We also present experimental results from this pilot implementation that demonstrate the viability of our approach.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115551329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design and implementation of PVFS-PM: a cluster file system on SCore 基于SCore的集群文件系统PVFS-PM的设计与实现
Koji Segawa, O. Tatebe, Yuetsu Kodama, T. Kudoh, T. Shimizu
This paper discusses the design and implementation of a cluster file system, called PVFS-PM, on the SCore cluster system software. This is the first attempt to implement a cluster file system on the SCore system. It is based on the PVFS cluster file system but replaces TCP with the PMv2 communication library supported by SCore to provide a scalable, high-performance cluster file system. PVFS-PM improves the performance by factors of 1.07 and 1.93 for writing and reading, respectively, with 8 I/O nodes, compared with the original PVFS on TCP on a Gigabit Ethernet-connected SCore cluster.
本文讨论了一个基于SCore集群系统软件的集群文件系统PVFS-PM的设计与实现。这是在SCore系统上实现集群文件系统的第一次尝试。它基于PVFS集群文件系统,但用SCore支持的PMv2通信库取代了TCP,以提供可扩展的高性能集群文件系统。与在千兆以太网连接的SCore集群上基于TCP的原始PVFS相比,PVFS- pm在8个I/O节点上的写和读性能分别提高了1.07和1.93倍。
{"title":"Design and implementation of PVFS-PM: a cluster file system on SCore","authors":"Koji Segawa, O. Tatebe, Yuetsu Kodama, T. Kudoh, T. Shimizu","doi":"10.1109/CCGRID.2003.1199436","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199436","url":null,"abstract":"This paper discusses the design and implementation of a cluster file system, called PVFS-PM, on the SCore cluster system software. This is the first attempt to implement a cluster file system on the SCore system. It is based on the PVFS cluster file system but replaces TCP with the PMv2 communication library supported by SCore to provide a scalable, high-performance cluster file system. PVFS-PM improves the performance by factors of 1.07 and 1.93 for writing and reading, respectively, with 8 I/O nodes, compared with the original PVFS on TCP on a Gigabit Ethernet-connected SCore cluster.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123493364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Building a performance model of streaming media applications in utility data center environment 建立了公用事业数据中心环境下流媒体应用的性能模型
L. Cherkasova, Loren Staley
Utility Data Center (UDC) provides a flexible, cost-effective infrastructure to support the hosting of applications for Internet services. In order to enable the design of a "utility-aware" streaming media service which automatically requests the necessary resources from UDC infrastructure, we introduce a set of benchmarks for measuring the basic capacities of streaming media systems. The benchmarks allow one to derive the scaling rules of server capacity for delivering media files which are: i) encoded at different bit rates, ii) streamed from memory vs disk. Using an experimental testbed, we show that these scaling rules are non-trivial. In this paper, we develop a workload-aware, media server performance model which is based on a cost function derived from the set of basic benchmark measurements. We validate this performance model by comparing the predicted and measured media server capacities for a set of synthetic workloads.
公用事业数据中心(Utility Data Center, UDC)提供了一种灵活的、经济高效的基础设施,以支持Internet服务的应用程序托管。为了实现“实用感知”流媒体服务的设计,该服务能够自动从UDC基础设施请求必要的资源,我们引入了一组基准来测量流媒体系统的基本容量。这些基准测试允许我们推导出传输媒体文件的服务器容量的缩放规则:i)以不同的比特率编码,ii)从内存和磁盘传输。通过一个实验平台,我们证明了这些缩放规则是非平凡的。在本文中,我们开发了一个工作负载感知的媒体服务器性能模型,该模型基于从基本基准测量集派生的成本函数。我们通过比较一组合成工作负载的预测和测量的媒体服务器容量来验证此性能模型。
{"title":"Building a performance model of streaming media applications in utility data center environment","authors":"L. Cherkasova, Loren Staley","doi":"10.1109/CCGRID.2003.1199352","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199352","url":null,"abstract":"Utility Data Center (UDC) provides a flexible, cost-effective infrastructure to support the hosting of applications for Internet services. In order to enable the design of a \"utility-aware\" streaming media service which automatically requests the necessary resources from UDC infrastructure, we introduce a set of benchmarks for measuring the basic capacities of streaming media systems. The benchmarks allow one to derive the scaling rules of server capacity for delivering media files which are: i) encoded at different bit rates, ii) streamed from memory vs disk. Using an experimental testbed, we show that these scaling rules are non-trivial. In this paper, we develop a workload-aware, media server performance model which is based on a cost function derived from the set of basic benchmark measurements. We validate this performance model by comparing the predicted and measured media server capacities for a set of synthetic workloads.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"307 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121262795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
P2P-RPC: programming scientific applications on peer-to-peer systems with remote procedure call P2P-RPC:在具有远程过程调用的对等系统上编程科学应用程序
Samir Djilali
This paper presents design and implementation of a remote Procedure call (RPC) API for programming applications on Peer-to-Peer environments. The P2P-RPC API is designed to address one of neglected aspect of Peer-to-Peer the lack of a simple programming interface. In this paper we examine one concrete implementation of the P2P-RPC-API derived from OmniRPC (an existing RPC API for the Grid based on Ninf system). This new API is implemented on top of low-level functionalities of the XtremWeb Peer-to-Peer Computing System. The minimal API defined in this paper provides a basic mechanism to make migrate a wide variety of applications using RPC mechanism to the Peer-to-Peer systems. We evaluate P2P-RPC for a numerical application (NAS EP Benchmark) and demonstrate its performance and fault tolerance properties.
本文提出了一个用于点对点环境下编程应用程序的远程过程调用(RPC) API的设计和实现。P2P-RPC API旨在解决点对点的一个被忽视的方面:缺乏简单的编程接口。在本文中,我们研究了源自OmniRPC(基于Ninf系统的现有网格RPC API)的P2P-RPC-API的一个具体实现。这个新的API是在XtremWeb对等计算系统的底层功能之上实现的。本文定义的最小API提供了一种基本机制,可以将使用RPC机制的各种应用程序迁移到对等系统。我们评估了P2P-RPC的数值应用程序(NAS EP基准),并展示了它的性能和容错特性。
{"title":"P2P-RPC: programming scientific applications on peer-to-peer systems with remote procedure call","authors":"Samir Djilali","doi":"10.1109/CCGRID.2003.1199394","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199394","url":null,"abstract":"This paper presents design and implementation of a remote Procedure call (RPC) API for programming applications on Peer-to-Peer environments. The P2P-RPC API is designed to address one of neglected aspect of Peer-to-Peer the lack of a simple programming interface. In this paper we examine one concrete implementation of the P2P-RPC-API derived from OmniRPC (an existing RPC API for the Grid based on Ninf system). This new API is implemented on top of low-level functionalities of the XtremWeb Peer-to-Peer Computing System. The minimal API defined in this paper provides a basic mechanism to make migrate a wide variety of applications using RPC mechanism to the Peer-to-Peer systems. We evaluate P2P-RPC for a numerical application (NAS EP Benchmark) and demonstrate its performance and fault tolerance properties.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121364145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Evaluation of the inter-cluster data transfer on Grid environment 网格环境下集群间数据传输的评估
Shoji Ogura, S. Matsuoka, H. Nakada
High-performance peer-to-peer transfer between clusters will be fundamental technology base for various Grid middleware, such as large-scale data transfer in DataGrid settings, or collective communication in Grid-wide MPIs. There, two major factors are involved: on one hand network pipes with large RTT /spl times/ bandwidth typically become data-starved, resulting in bandwidth loss; on the other hand when multiple nodes on the clusters attempt simultaneous transfer, the network pipe could become saturated, resulting in packet loss which again may result in bandwidth degradation in large RTT /spl times/ bandwidth networks. By dynamically and automatically adjusting transfer parameters between the two clusters, such as the number of network nodes, number of socket stripes, we could achieve optimal bandwidth even when the network is under heavy contention. In order to arrive at a proper performance model for automated adjustment, we have conducted several simulations by which we have discovered that such automatic tuning would beneficial, but the ideal number of network pipes does not exactly match the simple transfer model of traditional peer-to-peer settings between single nodes.
集群之间的高性能点对点传输将成为各种网格中间件的基础技术,例如DataGrid设置中的大规模数据传输,或网格范围MPIs中的集体通信。这里涉及两个主要因素:一方面,具有大RTT /spl时间/带宽的网络管道通常会变得数据匮乏,导致带宽损失;另一方面,当集群上的多个节点尝试同时传输时,网络管道可能会饱和,导致数据包丢失,这再次可能导致大RTT /spl次数/带宽网络中的带宽下降。通过动态自动调整两个集群之间的传输参数,如网络节点数、套接字条带数等,即使在网络竞争激烈的情况下,也能实现最优带宽。为了得到一个适合自动调整的性能模型,我们进行了几次模拟,通过这些模拟,我们发现这种自动调整是有益的,但是理想的网络管道数量并不完全匹配单个节点之间传统点对点设置的简单传输模型。
{"title":"Evaluation of the inter-cluster data transfer on Grid environment","authors":"Shoji Ogura, S. Matsuoka, H. Nakada","doi":"10.1109/CCGRID.2003.1199390","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199390","url":null,"abstract":"High-performance peer-to-peer transfer between clusters will be fundamental technology base for various Grid middleware, such as large-scale data transfer in DataGrid settings, or collective communication in Grid-wide MPIs. There, two major factors are involved: on one hand network pipes with large RTT /spl times/ bandwidth typically become data-starved, resulting in bandwidth loss; on the other hand when multiple nodes on the clusters attempt simultaneous transfer, the network pipe could become saturated, resulting in packet loss which again may result in bandwidth degradation in large RTT /spl times/ bandwidth networks. By dynamically and automatically adjusting transfer parameters between the two clusters, such as the number of network nodes, number of socket stripes, we could achieve optimal bandwidth even when the network is under heavy contention. In order to arrive at a proper performance model for automated adjustment, we have conducted several simulations by which we have discovered that such automatic tuning would beneficial, but the ideal number of network pipes does not exactly match the simple transfer model of traditional peer-to-peer settings between single nodes.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123644639","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Combining task- and data parallelism to speed up protein folding on a desktop grid platform 结合任务和数据并行性来加速桌面网格平台上的蛋白质折叠
Bennet Uk, M. Taufer, T. Stricker, G. Settanni, A. Cavalli, A. Caflisch
The steady increase of computing power at lower and lower cost enables molecular dynamics simulations to investigate the process of protein folding with an explicit treatment of water molecules. Such simulations are typically done with well known computational chemistry codes like CHARMM. Desktop grids such as the United Devices MetaProcessor are highly attractive platforms, since scavenging for unused machines on Intra- and Internet delivers compute power that is almost free. However, the predominant programming paradigm for current desktop grids is pure task parallelism and might not fit the needs for protein folding simulations with explicit water molecules. A short overall turn-around time of a simulation remains highly important for research productivity, but the need for an accurate model and long simulation time-scales leads to tasks that are too large for optimal scheduling on a desktop grid. To address this problem, we introduce a combination of task- and data parallelism as a well suitable computing paradigm for protein folding investigations on grid platforms. As a proof of concept, we design and implement a simple system for protein folding simulations based on the notion of combined task and data parallelism with clustered workers. Clustered workers are machines grouped into small clusters according to network and CPU performance criteria and act as super-nodes within a desktop grid, permitting the utilization of data parallelism in addition to the task parallelism. We integrate our new paradigm into the existing software environment of the United Devices MetaProcessor. For a test protein, we reach a better quality of the folding calculations than we reached using just task parallelism on distributed systems.
以越来越低的成本稳定增长的计算能力使分子动力学模拟能够研究蛋白质折叠过程与水分子的明确处理。这种模拟通常是用CHARMM等众所周知的计算化学代码完成的。像United Devices MetaProcessor这样的桌面网格是非常有吸引力的平台,因为在Intra和Internet上清理未使用的机器可以提供几乎免费的计算能力。然而,当前桌面网格的主要编程范式是纯粹的任务并行,可能不适合具有明确水分子的蛋白质折叠模拟的需要。模拟的短总体周转时间对于研究效率仍然非常重要,但对精确模型和长模拟时间尺度的需求导致任务过于庞大,无法在桌面网格上进行最佳调度。为了解决这个问题,我们引入了任务并行和数据并行的组合,作为网格平台上蛋白质折叠研究的一个非常合适的计算范式。作为概念验证,我们设计并实现了一个简单的蛋白质折叠模拟系统,该系统基于任务和数据并行化与集群工人的组合概念。集群工人是根据网络和CPU性能标准分组成小集群的机器,充当桌面网格中的超级节点,除了任务并行性之外,还允许利用数据并行性。我们将我们的新范例集成到United Devices元处理器的现有软件环境中。对于测试蛋白,我们达到了比在分布式系统上使用任务并行性更好的折叠计算质量。
{"title":"Combining task- and data parallelism to speed up protein folding on a desktop grid platform","authors":"Bennet Uk, M. Taufer, T. Stricker, G. Settanni, A. Cavalli, A. Caflisch","doi":"10.1109/CCGRID.2003.1199374","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199374","url":null,"abstract":"The steady increase of computing power at lower and lower cost enables molecular dynamics simulations to investigate the process of protein folding with an explicit treatment of water molecules. Such simulations are typically done with well known computational chemistry codes like CHARMM. Desktop grids such as the United Devices MetaProcessor are highly attractive platforms, since scavenging for unused machines on Intra- and Internet delivers compute power that is almost free. However, the predominant programming paradigm for current desktop grids is pure task parallelism and might not fit the needs for protein folding simulations with explicit water molecules. A short overall turn-around time of a simulation remains highly important for research productivity, but the need for an accurate model and long simulation time-scales leads to tasks that are too large for optimal scheduling on a desktop grid. To address this problem, we introduce a combination of task- and data parallelism as a well suitable computing paradigm for protein folding investigations on grid platforms. As a proof of concept, we design and implement a simple system for protein folding simulations based on the notion of combined task and data parallelism with clustered workers. Clustered workers are machines grouped into small clusters according to network and CPU performance criteria and act as super-nodes within a desktop grid, permitting the utilization of data parallelism in addition to the task parallelism. We integrate our new paradigm into the existing software environment of the United Devices MetaProcessor. For a test protein, we reach a better quality of the folding calculations than we reached using just task parallelism on distributed systems.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129852937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Programming for dependability in a service-based grid 基于服务的网格中的可靠性编程
Win Bausch, C. Pautasso, G. Alonso
Service-based Grid infrastructures emphasize service composition rather than sharing of low level resources. The idea is to build Grid applications out of computational services provided by the different sites of the Grid. Recent developments in the area of Web services have strengthened this idea by standardizing procedures like service description, publication and invocation. What is still missing is the infrastructure necessary to support the complete life cycle of applications running on service based Grids, i.e., suitable programming paradigms, execution infrastructure, and the ability to monitor and control such computations. Moreover, once computations are made of composable services, dependability becomes a key issue that needs to be addressed by the infrastructure as it cannot be addressed separately by each individual service. To address these concerns, we have developed the BioOpera Grid computing platform. BioOpera is a process support system for dependable cluster computing that has been extended with additional functionality to provide adequate support for service-based Grids. In this paper we describe how BioOpera can be used to develop, execute, and administer highly dependable computations over service-based Grids.
基于服务的网格基础设施强调服务组合,而不是低级资源的共享。其思想是利用网格的不同站点提供的计算服务来构建网格应用程序。Web服务领域的最新发展通过标准化服务描述、发布和调用等过程加强了这一思想。仍然缺少的是支持在基于服务的网格上运行的应用程序的完整生命周期所必需的基础设施,即合适的编程范例、执行基础设施以及监视和控制此类计算的能力。此外,一旦计算由可组合的服务组成,可靠性就成为需要由基础设施解决的关键问题,因为它不能由每个单独的服务单独解决。为了解决这些问题,我们开发了BioOpera网格计算平台。BioOpera是一个过程支持系统,用于可靠的集群计算,它已经扩展了额外的功能,为基于服务的网格提供足够的支持。在本文中,我们描述了如何使用BioOpera在基于服务的网格上开发、执行和管理高度可靠的计算。
{"title":"Programming for dependability in a service-based grid","authors":"Win Bausch, C. Pautasso, G. Alonso","doi":"10.1109/CCGRID.2003.1199365","DOIUrl":"https://doi.org/10.1109/CCGRID.2003.1199365","url":null,"abstract":"Service-based Grid infrastructures emphasize service composition rather than sharing of low level resources. The idea is to build Grid applications out of computational services provided by the different sites of the Grid. Recent developments in the area of Web services have strengthened this idea by standardizing procedures like service description, publication and invocation. What is still missing is the infrastructure necessary to support the complete life cycle of applications running on service based Grids, i.e., suitable programming paradigms, execution infrastructure, and the ability to monitor and control such computations. Moreover, once computations are made of composable services, dependability becomes a key issue that needs to be addressed by the infrastructure as it cannot be addressed separately by each individual service. To address these concerns, we have developed the BioOpera Grid computing platform. BioOpera is a process support system for dependable cluster computing that has been extended with additional functionality to provide adequate support for service-based Grids. In this paper we describe how BioOpera can be used to develop, execute, and administer highly dependable computations over service-based Grids.","PeriodicalId":433323,"journal":{"name":"CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128296582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
期刊
CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1