首页 > 最新文献

Proceedings. IEEE International Conference on Cluster Computing最新文献

英文 中文
Socket cloning for cluster-based web servers 基于集群的web服务器的套接字克隆
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137762
Y. Sit, Cho-Li Wang, F. Lau
Cluster-based web server is a popular solution to meet the demand of the ever-growing web traffic. However existing approaches suffer from several limitations to achieve this. Dispatcher-based systems either can achieve only coarse-grained load balancing or would introduce heavy load to the dispatcher Mechanisms like cooperative caching consume much network resources when transferring large cache objects. In this paper, we present a new network support mechanism, called Socket Cloning (SC), in which an opened socket can be migrated efficiently between cluster nodes. With SC, the processing of HTTP requests can be moved to the node that has a cached copy of the requested document, thus bypassing any object transfer between peer servers. A prototype has been implemented and tests show that SC incurs less overhead than all the mentioned approaches. In trace-driven benchmark tests, our system outperforms these approaches by more than 30% with a cluster of twelve web server nodes.
基于集群的web服务器是满足日益增长的web流量需求的一种流行的解决方案。然而,现有的方法在实现这一目标方面存在一些限制。基于调度程序的系统要么只能实现粗粒度的负载平衡,要么会给调度程序带来沉重的负载。在传输大型缓存对象时,协作缓存等机制会消耗大量网络资源。本文提出了一种新的网络支持机制,称为套接字克隆(Socket Cloning, SC),该机制可以使打开的套接字在集群节点之间高效迁移。使用SC,可以将HTTP请求的处理转移到具有所请求文档的缓存副本的节点,从而绕过对等服务器之间的任何对象传输。已经实现了一个原型,测试表明SC比所有上述方法产生的开销都要少。在跟踪驱动的基准测试中,我们的系统在12个web服务器节点的集群中比这些方法的性能高出30%以上。
{"title":"Socket cloning for cluster-based web servers","authors":"Y. Sit, Cho-Li Wang, F. Lau","doi":"10.1109/CLUSTR.2002.1137762","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137762","url":null,"abstract":"Cluster-based web server is a popular solution to meet the demand of the ever-growing web traffic. However existing approaches suffer from several limitations to achieve this. Dispatcher-based systems either can achieve only coarse-grained load balancing or would introduce heavy load to the dispatcher Mechanisms like cooperative caching consume much network resources when transferring large cache objects. In this paper, we present a new network support mechanism, called Socket Cloning (SC), in which an opened socket can be migrated efficiently between cluster nodes. With SC, the processing of HTTP requests can be moved to the node that has a cached copy of the requested document, thus bypassing any object transfer between peer servers. A prototype has been implemented and tests show that SC incurs less overhead than all the mentioned approaches. In trace-driven benchmark tests, our system outperforms these approaches by more than 30% with a cluster of twelve web server nodes.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85018853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Cplant/sup /spl trade// runtime system support for multi-processor and heterogeneous compute nodes 支持多处理器和异构计算节点的运行时系统
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137748
K. Pedretti, R. Brightwell, Joshua Williams
In this paper, we describe additions and modifications to the Computational Plant (Cplant/sup /spl trade//) system software to support multi-processor compute nodes and to support heterogeneous node types. We describe how these capabilities have been incorporated into our scalable runtime system and how these changes affect the interface seen by end users and application developers. We also discuss several important operating system and networking issues that can directly impact application performance. We present some initial performance metrics that indicate how our current implementation scales when multiple processes are running on a single node.
在本文中,我们描述了对Computational Plant (Plant /sup /spl trade//)系统软件的添加和修改,以支持多处理器计算节点和异构节点类型。我们描述了如何将这些功能整合到我们的可伸缩运行时系统中,以及这些更改如何影响最终用户和应用程序开发人员看到的界面。我们还讨论了几个可以直接影响应用程序性能的重要操作系统和网络问题。我们提供了一些初始性能指标,这些指标表明当多个进程在单个节点上运行时,我们当前的实现是如何扩展的。
{"title":"Cplant/sup /spl trade// runtime system support for multi-processor and heterogeneous compute nodes","authors":"K. Pedretti, R. Brightwell, Joshua Williams","doi":"10.1109/CLUSTR.2002.1137748","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137748","url":null,"abstract":"In this paper, we describe additions and modifications to the Computational Plant (Cplant/sup /spl trade//) system software to support multi-processor compute nodes and to support heterogeneous node types. We describe how these capabilities have been incorporated into our scalable runtime system and how these changes affect the interface seen by end users and application developers. We also discuss several important operating system and networking issues that can directly impact application performance. We present some initial performance metrics that indicate how our current implementation scales when multiple processes are running on a single node.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82950919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
CEJVM: "cluster enabled Java Virtual Machine" 启用集群的Java虚拟机
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137771
M. Janjua, M. M. Yasin, Ch. Falak Sher, K. Awan, I. Hassan
CEJVM is a cluster enabled Java Virtual Machine, which executes in a distributed fashion among collaborating nodes of a dedicated cluster. It extends Java's multithreading mechanism to the parallel computing paradigm by transparent migration of independent application threads modeled in master worker paradigm. The goal is to obtain improved performance for computationally-intensive multi-threaded Java programs without modifying traditional JVM code, Java language or compiler implementation. Deploying a master worker relationship among the nodes in the cluster, CEJVM monitors, packs, transports and resurrects the Java threads on "Master JVM" and helper "Worker JVMs". Currently, we have created a prototype that runs pure Java applications on the local Ethernet based cluster of Win2K computers. We have achieved appreciable speedup for an ANN training program written in Java (with independent threads) on CEJVM without any modification to the program's source code.
CEJVM是一个支持集群的Java虚拟机,它在专用集群的协作节点之间以分布式方式执行。它通过透明地迁移在主工作者范式中建模的独立应用程序线程,将Java的多线程机制扩展到并行计算范式。目标是在不修改传统JVM代码、Java语言或编译器实现的情况下,为计算密集型多线程Java程序获得改进的性能。在集群中的节点之间部署一个主worker关系,CEJVM监视、打包、传输和恢复“主JVM”和辅助“worker JVM”上的Java线程。目前,我们已经创建了一个原型,在基于本地以太网的Win2K计算机集群上运行纯Java应用程序。我们已经在CEJVM上实现了用Java(独立线程)编写的人工神经网络训练程序的显著加速,而无需对程序的源代码进行任何修改。
{"title":"CEJVM: \"cluster enabled Java Virtual Machine\"","authors":"M. Janjua, M. M. Yasin, Ch. Falak Sher, K. Awan, I. Hassan","doi":"10.1109/CLUSTR.2002.1137771","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137771","url":null,"abstract":"CEJVM is a cluster enabled Java Virtual Machine, which executes in a distributed fashion among collaborating nodes of a dedicated cluster. It extends Java's multithreading mechanism to the parallel computing paradigm by transparent migration of independent application threads modeled in master worker paradigm. The goal is to obtain improved performance for computationally-intensive multi-threaded Java programs without modifying traditional JVM code, Java language or compiler implementation. Deploying a master worker relationship among the nodes in the cluster, CEJVM monitors, packs, transports and resurrects the Java threads on \"Master JVM\" and helper \"Worker JVMs\". Currently, we have created a prototype that runs pure Java applications on the local Ethernet based cluster of Win2K computers. We have achieved appreciable speedup for an ANN training program written in Java (with independent threads) on CEJVM without any modification to the program's source code.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87738134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Supporting MPI collective communication on network processors 支持网络处理器上的MPI集体通信
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137731
Qianfeng Zhang, C. Keppitiyagama, Alan S. Wagner
We present work that extends our previous Myrinet port for LAM/MPI, MPI-NP, with collective communication primitives on the NIC. This work is another step in our experiment of making the NIC MPI aware. We believe that an MPI aware control program on the NIC can deliver a richer set of performance enhancements, not just restricted to better bandwidth/latency, to MPI applications. MPI collective communication involves considerable interactions between the communication subsystems of the nodes that are not of any direct interest to the application. By migrating these talkative components to the Myrinet network interface card we allow this dialog between the nodes to happen with minimum latency. We explore the advantage of supporting several MPI collective communication routines on the NIC. These include MPI /spl I.bar/Bcast (), MPI/spl I.bar/Barrier and MPI/spl I.bar/Comm/spl I.bar/Create ().
我们提出的工作扩展了我们以前的用于LAM/MPI, MPI- np的Myrinet端口,在NIC上具有集体通信原语。这项工作是我们使NIC MPI感知的实验的又一步。我们相信,NIC上的MPI感知控制程序可以为MPI应用程序提供更丰富的性能增强,而不仅仅局限于更好的带宽/延迟。MPI集体通信涉及节点通信子系统之间的大量交互,这些交互与应用程序没有任何直接关系。通过将这些对话组件迁移到Myrinet网络接口卡,我们允许节点之间的对话以最小的延迟发生。我们探讨了在NIC上支持多个MPI集体通信例程的优势。这些包括MPI/spl .bar/Bcast (), MPI/spl .bar/Barrier和MPI/spl .bar/Comm/spl .bar/Create()。
{"title":"Supporting MPI collective communication on network processors","authors":"Qianfeng Zhang, C. Keppitiyagama, Alan S. Wagner","doi":"10.1109/CLUSTR.2002.1137731","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137731","url":null,"abstract":"We present work that extends our previous Myrinet port for LAM/MPI, MPI-NP, with collective communication primitives on the NIC. This work is another step in our experiment of making the NIC MPI aware. We believe that an MPI aware control program on the NIC can deliver a richer set of performance enhancements, not just restricted to better bandwidth/latency, to MPI applications. MPI collective communication involves considerable interactions between the communication subsystems of the nodes that are not of any direct interest to the application. By migrating these talkative components to the Myrinet network interface card we allow this dialog between the nodes to happen with minimum latency. We explore the advantage of supporting several MPI collective communication routines on the NIC. These include MPI /spl I.bar/Bcast (), MPI/spl I.bar/Barrier and MPI/spl I.bar/Comm/spl I.bar/Create ().","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87070219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Design and validation of portable communication infrastructure for fault-tolerant cluster middleware 面向容错集群中间件的便携式通信基础设施的设计与验证
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137755
Ming Li, Wenchao Tao, Dani Goldberg, I. Hsu, Y. Tamir
We describe the communication infrastructure (CI) for our fault-tolerant cluster middleware, which is optimized for two classes of communication: for the applications and for the cluster management middleware. This CI was designed for portability and for efficient operation on top of modern user-level message passing mechanisms. We present a functional fault model for the CI and show how platform-specific faults map to this fault model. Based on this fault model, we have developed a fault injection scheme that is integrated with the CI and is thus portable across different communication technologies. We have used fault injection to validate and evaluate the implementation of the CI itself as well as the cluster management middleware in the presence of communication faults.
我们描述了容错集群中间件的通信基础设施(CI),它针对两类通信进行了优化:应用程序通信和集群管理中间件通信。该CI是为可移植性和在现代用户级消息传递机制之上的高效操作而设计的。我们为CI提供了一个功能故障模型,并展示了特定于平台的故障如何映射到该故障模型。基于这个故障模型,我们开发了一个与CI集成的故障注入方案,因此可以跨不同的通信技术进行移植。我们已经使用故障注入来验证和评估存在通信故障的CI本身以及集群管理中间件的实现。
{"title":"Design and validation of portable communication infrastructure for fault-tolerant cluster middleware","authors":"Ming Li, Wenchao Tao, Dani Goldberg, I. Hsu, Y. Tamir","doi":"10.1109/CLUSTR.2002.1137755","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137755","url":null,"abstract":"We describe the communication infrastructure (CI) for our fault-tolerant cluster middleware, which is optimized for two classes of communication: for the applications and for the cluster management middleware. This CI was designed for portability and for efficient operation on top of modern user-level message passing mechanisms. We present a functional fault model for the CI and show how platform-specific faults map to this fault model. Based on this fault model, we have developed a fault injection scheme that is integrated with the CI and is thus portable across different communication technologies. We have used fault injection to validate and evaluate the implementation of the CI itself as well as the cluster management middleware in the presence of communication faults.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76930907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Scalable cluster administration - Chiba City I approach and lessons learned 可伸缩的集群管理——千叶城市I的方法和经验教训
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137749
J. Navarro, R. Evard, Daniel Nurmi, N. Desai
Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.
大型集群的系统管理员经常需要执行数百或数千次相同的管理任务。传统上,管理员手动执行一些耗时的任务,如操作系统安装、配置和维护。通过将网络服务(如DHCP、TFTP、FTP、HTTP和NFS)与远程硬件控制和脚本化安装、配置和维护技术相结合,集群管理员可以自动执行这些管理任务。可伸缩集群管理解决了这一挑战:集群构建器可以使用哪些硬件和软件设计技术来在非常大的集群上自动化集群管理?我们描述了阿贡国家实验室的数学和计算机科学部在千叶市I上使用的方法,一个314个节点的Linux集群;我们分析了这种方法的可伸缩性、灵活性、性能和可靠性的优点和局限性。
{"title":"Scalable cluster administration - Chiba City I approach and lessons learned","authors":"J. Navarro, R. Evard, Daniel Nurmi, N. Desai","doi":"10.1109/CLUSTR.2002.1137749","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137749","url":null,"abstract":"Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76857516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Research directions in parallel I/O for clusters 集群并行I/O的研究方向
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137777
W. Ligon
Parallel I/O remains a critical problem for cluster computing. A significant number of important applications need high performance parallel I/O and most cluster systems provide enough hardware to deliver the required performance. System software for achieving the desired goals remains in the research and development stage. A number of parallel file systems have achieved remarkable goals in one or more of several key areas related to parallel I/O, but there is still great reluctance to commit to any file system currently available. This is mostly due to the fact that these file systems do not address enough issues at once in a package that is robust enough for widespread use. Critical goals in the development of an operation parallel file system for clusters include: high performance with scalability; reliability/fault tolerance; flexible and efficient integration with parallel codes; portability. These issues give rise to problems with interfaces and semantics, in addition to specific technical problems such as distributed locking, caching, and redundancy. The next generation of parallel file systems must look beyond traditional interfaces, semantics, and implementation methods in order achieve the desired goals. Of equal importance is the issue of knowing to what extent a given file system achieves these goals. Given that no file system is likely to address all of these goals equally well, it is important to be able to measure a given file system's utility in these areas through benchmarking or other evaluation methods. We explore a few of these issues and include specific examples and a case study of the PVFS V2 team's approach to these issues.
并行I/O仍然是集群计算的一个关键问题。大量重要的应用程序需要高性能并行I/O,大多数集群系统提供足够的硬件来提供所需的性能。实现预期目标的系统软件仍处于研发阶段。许多并行文件系统已经在与并行I/O相关的几个关键领域中的一个或多个领域实现了显著的目标,但是仍然非常不愿意提交到当前可用的任何文件系统。这主要是由于这样一个事实,即这些文件系统不能在一个足够健壮、可以广泛使用的包中一次解决足够多的问题。为集群开发操作并行文件系统的关键目标包括:高性能和可伸缩性;可靠性/容错;灵活高效地与并行代码集成;可移植性。除了分布式锁定、缓存和冗余等特定技术问题外,这些问题还会引起接口和语义方面的问题。为了达到预期的目标,下一代并行文件系统必须超越传统的接口、语义和实现方法。同样重要的是了解给定文件系统在多大程度上实现了这些目标。考虑到没有文件系统可能同样地满足所有这些目标,因此能够通过基准测试或其他评估方法度量给定文件系统在这些领域的效用是很重要的。我们将探讨其中的一些问题,并包括PVFS V2团队解决这些问题的具体示例和案例研究。
{"title":"Research directions in parallel I/O for clusters","authors":"W. Ligon","doi":"10.1109/CLUSTR.2002.1137777","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137777","url":null,"abstract":"Parallel I/O remains a critical problem for cluster computing. A significant number of important applications need high performance parallel I/O and most cluster systems provide enough hardware to deliver the required performance. System software for achieving the desired goals remains in the research and development stage. A number of parallel file systems have achieved remarkable goals in one or more of several key areas related to parallel I/O, but there is still great reluctance to commit to any file system currently available. This is mostly due to the fact that these file systems do not address enough issues at once in a package that is robust enough for widespread use. Critical goals in the development of an operation parallel file system for clusters include: high performance with scalability; reliability/fault tolerance; flexible and efficient integration with parallel codes; portability. These issues give rise to problems with interfaces and semantics, in addition to specific technical problems such as distributed locking, caching, and redundancy. The next generation of parallel file systems must look beyond traditional interfaces, semantics, and implementation methods in order achieve the desired goals. Of equal importance is the issue of knowing to what extent a given file system achieves these goals. Given that no file system is likely to address all of these goals equally well, it is important to be able to measure a given file system's utility in these areas through benchmarking or other evaluation methods. We explore a few of these issues and include specific examples and a case study of the PVFS V2 team's approach to these issues.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81702709","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Integrated admission and congestion control for QoS support in clusters 集成准入和拥塞控制,支持集群中的QoS
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137761
K. H. Yum, Eun Jung Kim, C. Das, Mazin S. Yousif, J. Duato
Admission and congestion control mechanisms are integral parts of any Quality of Service (QoS) design for networks that support integrated traffic. In this paper we propose an-admission control algorithm and a congestion control algorithm for clusters, which are increasingly being used in a diverse set of applications that require QoS guarantees. The uniqueness of our approach is that we develop these algorithms for wormhole-switched networks. We use QoS-capable wormhole routers and QoS-capable network interface cards (NICs), referred to as Host Channel Adapters (HCAs) in InfiniBand/spl trade/ Architecture (IBA), to evaluate the effectiveness of these algorithms. The admission control is applied at the HCAs and the routers, while the congestion control is deployed only at the HCAs. Simulation results indicate that the admission and congestion control algorithms are quite effective in delivering the assured performance. The proposed credit-based congestion control algorithm is simple and practical in that it relies on hardware already available in the HCA to regulate traffic injection.
接纳和拥塞控制机制是支持集成流量的网络的任何服务质量(QoS)设计的组成部分。在本文中,我们提出了一种用于集群的接纳控制算法和拥塞控制算法,这两种算法越来越多地用于需要QoS保证的各种应用中。我们方法的独特之处在于我们为虫洞交换网络开发了这些算法。我们使用支持qos的虫洞路由器和支持qos的网络接口卡(nic),在InfiniBand/spl贸易/架构(IBA)中称为主机通道适配器(hca),来评估这些算法的有效性。接纳控制作用于hca和路由器,拥塞控制作用于hca。仿真结果表明,接纳和拥塞控制算法在保证性能方面是非常有效的。提出的基于信用的拥塞控制算法简单实用,它依赖于HCA中已有的硬件来调节流量注入。
{"title":"Integrated admission and congestion control for QoS support in clusters","authors":"K. H. Yum, Eun Jung Kim, C. Das, Mazin S. Yousif, J. Duato","doi":"10.1109/CLUSTR.2002.1137761","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137761","url":null,"abstract":"Admission and congestion control mechanisms are integral parts of any Quality of Service (QoS) design for networks that support integrated traffic. In this paper we propose an-admission control algorithm and a congestion control algorithm for clusters, which are increasingly being used in a diverse set of applications that require QoS guarantees. The uniqueness of our approach is that we develop these algorithms for wormhole-switched networks. We use QoS-capable wormhole routers and QoS-capable network interface cards (NICs), referred to as Host Channel Adapters (HCAs) in InfiniBand/spl trade/ Architecture (IBA), to evaluate the effectiveness of these algorithms. The admission control is applied at the HCAs and the routers, while the congestion control is deployed only at the HCAs. Simulation results indicate that the admission and congestion control algorithms are quite effective in delivering the assured performance. The proposed credit-based congestion control algorithm is simple and practical in that it relies on hardware already available in the HCA to regulate traffic injection.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85309481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Protocol-dependent message-passing performance on Linux clusters Linux集群上依赖于协议的消息传递性能
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137746
D. Turner, Xuehua Chen
In a Linux cluster, as in any multiprocessor system, the inter-processor communication rate is the major limiting factor to its general usefulness. This research is geared toward improving the communication performance by identifying where the inefficiencies lie and trying to understand their cause. The NetPIPE utility is being used to compare the latency and throughput of all current message-passing libraries and the native software layers they run upon for a variety of hardware configurations.
在Linux集群中,就像在任何多处理器系统中一样,处理器间通信速率是限制其一般用途的主要因素。这项研究旨在通过确定低效的地方并试图了解其原因来提高通信性能。NetPIPE实用程序用于比较所有当前消息传递库的延迟和吞吐量,以及它们在各种硬件配置下运行的本机软件层。
{"title":"Protocol-dependent message-passing performance on Linux clusters","authors":"D. Turner, Xuehua Chen","doi":"10.1109/CLUSTR.2002.1137746","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137746","url":null,"abstract":"In a Linux cluster, as in any multiprocessor system, the inter-processor communication rate is the major limiting factor to its general usefulness. This research is geared toward improving the communication performance by identifying where the inefficiencies lie and trying to understand their cause. The NetPIPE utility is being used to compare the latency and throughput of all current message-passing libraries and the native software layers they run upon for a variety of hardware configurations.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85481978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 54
A new architecture for secure carrier-class clusters 安全的电信级集群的新架构
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137790
M. Pourzandi, I. Haddad, C. Levert, Miroslaw Zakrzewski, M. Dagenais
Traditionally the telecom industry has used clusters to meet its carrier-class requirements of high availability, reliability, and scalability, while relying on cost-effective hardware and software. Efficient cluster security is now an essential requirement and has not yet been addressed in a coherent fashion on clustered systems. This paper presents an approach for distributed security architecture that supports advanced security mechanisms for current and future security needs, targeted for carrier-class application servers running on clustered systems.
传统上,电信行业一直使用集群来满足其电信级的高可用性、可靠性和可伸缩性需求,同时依赖于经济高效的硬件和软件。高效的集群安全性现在是一项基本需求,但在集群系统上还没有以一致的方式得到解决。本文提出了一种分布式安全架构的方法,它支持当前和未来安全需求的高级安全机制,针对运行在集群系统上的运营商级应用服务器。
{"title":"A new architecture for secure carrier-class clusters","authors":"M. Pourzandi, I. Haddad, C. Levert, Miroslaw Zakrzewski, M. Dagenais","doi":"10.1109/CLUSTR.2002.1137790","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137790","url":null,"abstract":"Traditionally the telecom industry has used clusters to meet its carrier-class requirements of high availability, reliability, and scalability, while relying on cost-effective hardware and software. Efficient cluster security is now an essential requirement and has not yet been addressed in a coherent fashion on clustered systems. This paper presents an approach for distributed security architecture that supports advanced security mechanisms for current and future security needs, targeted for carrier-class application servers running on clustered systems.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83879695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Proceedings. IEEE International Conference on Cluster Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1