首页 > 最新文献

Proceedings. IEEE International Conference on Cluster Computing最新文献

英文 中文
Socket cloning for cluster-based web servers 基于集群的web服务器的套接字克隆
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137762
Y. Sit, Cho-Li Wang, F. Lau
Cluster-based web server is a popular solution to meet the demand of the ever-growing web traffic. However existing approaches suffer from several limitations to achieve this. Dispatcher-based systems either can achieve only coarse-grained load balancing or would introduce heavy load to the dispatcher Mechanisms like cooperative caching consume much network resources when transferring large cache objects. In this paper, we present a new network support mechanism, called Socket Cloning (SC), in which an opened socket can be migrated efficiently between cluster nodes. With SC, the processing of HTTP requests can be moved to the node that has a cached copy of the requested document, thus bypassing any object transfer between peer servers. A prototype has been implemented and tests show that SC incurs less overhead than all the mentioned approaches. In trace-driven benchmark tests, our system outperforms these approaches by more than 30% with a cluster of twelve web server nodes.
基于集群的web服务器是满足日益增长的web流量需求的一种流行的解决方案。然而,现有的方法在实现这一目标方面存在一些限制。基于调度程序的系统要么只能实现粗粒度的负载平衡,要么会给调度程序带来沉重的负载。在传输大型缓存对象时,协作缓存等机制会消耗大量网络资源。本文提出了一种新的网络支持机制,称为套接字克隆(Socket Cloning, SC),该机制可以使打开的套接字在集群节点之间高效迁移。使用SC,可以将HTTP请求的处理转移到具有所请求文档的缓存副本的节点,从而绕过对等服务器之间的任何对象传输。已经实现了一个原型,测试表明SC比所有上述方法产生的开销都要少。在跟踪驱动的基准测试中,我们的系统在12个web服务器节点的集群中比这些方法的性能高出30%以上。
{"title":"Socket cloning for cluster-based web servers","authors":"Y. Sit, Cho-Li Wang, F. Lau","doi":"10.1109/CLUSTR.2002.1137762","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137762","url":null,"abstract":"Cluster-based web server is a popular solution to meet the demand of the ever-growing web traffic. However existing approaches suffer from several limitations to achieve this. Dispatcher-based systems either can achieve only coarse-grained load balancing or would introduce heavy load to the dispatcher Mechanisms like cooperative caching consume much network resources when transferring large cache objects. In this paper, we present a new network support mechanism, called Socket Cloning (SC), in which an opened socket can be migrated efficiently between cluster nodes. With SC, the processing of HTTP requests can be moved to the node that has a cached copy of the requested document, thus bypassing any object transfer between peer servers. A prototype has been implemented and tests show that SC incurs less overhead than all the mentioned approaches. In trace-driven benchmark tests, our system outperforms these approaches by more than 30% with a cluster of twelve web server nodes.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"14 1","pages":"333-340"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85018853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Scalable cluster administration - Chiba City I approach and lessons learned 可伸缩的集群管理——千叶城市I的方法和经验教训
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137749
J. Navarro, R. Evard, Daniel Nurmi, N. Desai
Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.
大型集群的系统管理员经常需要执行数百或数千次相同的管理任务。传统上,管理员手动执行一些耗时的任务,如操作系统安装、配置和维护。通过将网络服务(如DHCP、TFTP、FTP、HTTP和NFS)与远程硬件控制和脚本化安装、配置和维护技术相结合,集群管理员可以自动执行这些管理任务。可伸缩集群管理解决了这一挑战:集群构建器可以使用哪些硬件和软件设计技术来在非常大的集群上自动化集群管理?我们描述了阿贡国家实验室的数学和计算机科学部在千叶市I上使用的方法,一个314个节点的Linux集群;我们分析了这种方法的可伸缩性、灵活性、性能和可靠性的优点和局限性。
{"title":"Scalable cluster administration - Chiba City I approach and lessons learned","authors":"J. Navarro, R. Evard, Daniel Nurmi, N. Desai","doi":"10.1109/CLUSTR.2002.1137749","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137749","url":null,"abstract":"Systems administrators of large clusters often need to perform the same administrative task hundreds or thousands of times. Administrators have traditionally performed some time-consuming tasks, such as operating system installation, configuration, and maintenance, manually. By combining network services such as DHCP, TFTP, FTP, HTTP, and NFS with remote hardware control and scripted installation, configuration, and maintenance techniques, cluster administrators can automate these administrative tasks. Scalable cluster administration addresses this challenge: What hardware and software design techniques can cluster builders use to automate cluster administration on very large clusters? We describe the approach used in the Mathematics and Computer Science Division of Argonne National Laboratory on Chiba City I, a 314-node Linux cluster; and we analyze the scalability, flexibility, performance and reliability benefits and limitations from that approach.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"95 1","pages":"215-221"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76857516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Design and validation of portable communication infrastructure for fault-tolerant cluster middleware 面向容错集群中间件的便携式通信基础设施的设计与验证
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137755
Ming Li, Wenchao Tao, Dani Goldberg, I. Hsu, Y. Tamir
We describe the communication infrastructure (CI) for our fault-tolerant cluster middleware, which is optimized for two classes of communication: for the applications and for the cluster management middleware. This CI was designed for portability and for efficient operation on top of modern user-level message passing mechanisms. We present a functional fault model for the CI and show how platform-specific faults map to this fault model. Based on this fault model, we have developed a fault injection scheme that is integrated with the CI and is thus portable across different communication technologies. We have used fault injection to validate and evaluate the implementation of the CI itself as well as the cluster management middleware in the presence of communication faults.
我们描述了容错集群中间件的通信基础设施(CI),它针对两类通信进行了优化:应用程序通信和集群管理中间件通信。该CI是为可移植性和在现代用户级消息传递机制之上的高效操作而设计的。我们为CI提供了一个功能故障模型,并展示了特定于平台的故障如何映射到该故障模型。基于这个故障模型,我们开发了一个与CI集成的故障注入方案,因此可以跨不同的通信技术进行移植。我们已经使用故障注入来验证和评估存在通信故障的CI本身以及集群管理中间件的实现。
{"title":"Design and validation of portable communication infrastructure for fault-tolerant cluster middleware","authors":"Ming Li, Wenchao Tao, Dani Goldberg, I. Hsu, Y. Tamir","doi":"10.1109/CLUSTR.2002.1137755","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137755","url":null,"abstract":"We describe the communication infrastructure (CI) for our fault-tolerant cluster middleware, which is optimized for two classes of communication: for the applications and for the cluster management middleware. This CI was designed for portability and for efficient operation on top of modern user-level message passing mechanisms. We present a functional fault model for the CI and show how platform-specific faults map to this fault model. Based on this fault model, we have developed a fault injection scheme that is integrated with the CI and is thus portable across different communication technologies. We have used fault injection to validate and evaluate the implementation of the CI itself as well as the cluster management middleware in the presence of communication faults.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"191 1","pages":"266-274"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"76930907","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Cplant/sup /spl trade// runtime system support for multi-processor and heterogeneous compute nodes 支持多处理器和异构计算节点的运行时系统
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137748
K. Pedretti, R. Brightwell, Joshua Williams
In this paper, we describe additions and modifications to the Computational Plant (Cplant/sup /spl trade//) system software to support multi-processor compute nodes and to support heterogeneous node types. We describe how these capabilities have been incorporated into our scalable runtime system and how these changes affect the interface seen by end users and application developers. We also discuss several important operating system and networking issues that can directly impact application performance. We present some initial performance metrics that indicate how our current implementation scales when multiple processes are running on a single node.
在本文中,我们描述了对Computational Plant (Plant /sup /spl trade//)系统软件的添加和修改,以支持多处理器计算节点和异构节点类型。我们描述了如何将这些功能整合到我们的可伸缩运行时系统中,以及这些更改如何影响最终用户和应用程序开发人员看到的界面。我们还讨论了几个可以直接影响应用程序性能的重要操作系统和网络问题。我们提供了一些初始性能指标,这些指标表明当多个进程在单个节点上运行时,我们当前的实现是如何扩展的。
{"title":"Cplant/sup /spl trade// runtime system support for multi-processor and heterogeneous compute nodes","authors":"K. Pedretti, R. Brightwell, Joshua Williams","doi":"10.1109/CLUSTR.2002.1137748","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137748","url":null,"abstract":"In this paper, we describe additions and modifications to the Computational Plant (Cplant/sup /spl trade//) system software to support multi-processor compute nodes and to support heterogeneous node types. We describe how these capabilities have been incorporated into our scalable runtime system and how these changes affect the interface seen by end users and application developers. We also discuss several important operating system and networking issues that can directly impact application performance. We present some initial performance metrics that indicate how our current implementation scales when multiple processes are running on a single node.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"39 1","pages":"207-214"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82950919","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Supporting MPI collective communication on network processors 支持网络处理器上的MPI集体通信
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137731
Qianfeng Zhang, C. Keppitiyagama, Alan S. Wagner
We present work that extends our previous Myrinet port for LAM/MPI, MPI-NP, with collective communication primitives on the NIC. This work is another step in our experiment of making the NIC MPI aware. We believe that an MPI aware control program on the NIC can deliver a richer set of performance enhancements, not just restricted to better bandwidth/latency, to MPI applications. MPI collective communication involves considerable interactions between the communication subsystems of the nodes that are not of any direct interest to the application. By migrating these talkative components to the Myrinet network interface card we allow this dialog between the nodes to happen with minimum latency. We explore the advantage of supporting several MPI collective communication routines on the NIC. These include MPI /spl I.bar/Bcast (), MPI/spl I.bar/Barrier and MPI/spl I.bar/Comm/spl I.bar/Create ().
我们提出的工作扩展了我们以前的用于LAM/MPI, MPI- np的Myrinet端口,在NIC上具有集体通信原语。这项工作是我们使NIC MPI感知的实验的又一步。我们相信,NIC上的MPI感知控制程序可以为MPI应用程序提供更丰富的性能增强,而不仅仅局限于更好的带宽/延迟。MPI集体通信涉及节点通信子系统之间的大量交互,这些交互与应用程序没有任何直接关系。通过将这些对话组件迁移到Myrinet网络接口卡,我们允许节点之间的对话以最小的延迟发生。我们探讨了在NIC上支持多个MPI集体通信例程的优势。这些包括MPI/spl .bar/Bcast (), MPI/spl .bar/Barrier和MPI/spl .bar/Comm/spl .bar/Create()。
{"title":"Supporting MPI collective communication on network processors","authors":"Qianfeng Zhang, C. Keppitiyagama, Alan S. Wagner","doi":"10.1109/CLUSTR.2002.1137731","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137731","url":null,"abstract":"We present work that extends our previous Myrinet port for LAM/MPI, MPI-NP, with collective communication primitives on the NIC. This work is another step in our experiment of making the NIC MPI aware. We believe that an MPI aware control program on the NIC can deliver a richer set of performance enhancements, not just restricted to better bandwidth/latency, to MPI applications. MPI collective communication involves considerable interactions between the communication subsystems of the nodes that are not of any direct interest to the application. By migrating these talkative components to the Myrinet network interface card we allow this dialog between the nodes to happen with minimum latency. We explore the advantage of supporting several MPI collective communication routines on the NIC. These include MPI /spl I.bar/Bcast (), MPI/spl I.bar/Barrier and MPI/spl I.bar/Comm/spl I.bar/Create ().","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"16 1","pages":"75-82"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87070219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
CEJVM: "cluster enabled Java Virtual Machine" 启用集群的Java虚拟机
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137771
M. Janjua, M. M. Yasin, Ch. Falak Sher, K. Awan, I. Hassan
CEJVM is a cluster enabled Java Virtual Machine, which executes in a distributed fashion among collaborating nodes of a dedicated cluster. It extends Java's multithreading mechanism to the parallel computing paradigm by transparent migration of independent application threads modeled in master worker paradigm. The goal is to obtain improved performance for computationally-intensive multi-threaded Java programs without modifying traditional JVM code, Java language or compiler implementation. Deploying a master worker relationship among the nodes in the cluster, CEJVM monitors, packs, transports and resurrects the Java threads on "Master JVM" and helper "Worker JVMs". Currently, we have created a prototype that runs pure Java applications on the local Ethernet based cluster of Win2K computers. We have achieved appreciable speedup for an ANN training program written in Java (with independent threads) on CEJVM without any modification to the program's source code.
CEJVM是一个支持集群的Java虚拟机,它在专用集群的协作节点之间以分布式方式执行。它通过透明地迁移在主工作者范式中建模的独立应用程序线程,将Java的多线程机制扩展到并行计算范式。目标是在不修改传统JVM代码、Java语言或编译器实现的情况下,为计算密集型多线程Java程序获得改进的性能。在集群中的节点之间部署一个主worker关系,CEJVM监视、打包、传输和恢复“主JVM”和辅助“worker JVM”上的Java线程。目前,我们已经创建了一个原型,在基于本地以太网的Win2K计算机集群上运行纯Java应用程序。我们已经在CEJVM上实现了用Java(独立线程)编写的人工神经网络训练程序的显著加速,而无需对程序的源代码进行任何修改。
{"title":"CEJVM: \"cluster enabled Java Virtual Machine\"","authors":"M. Janjua, M. M. Yasin, Ch. Falak Sher, K. Awan, I. Hassan","doi":"10.1109/CLUSTR.2002.1137771","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137771","url":null,"abstract":"CEJVM is a cluster enabled Java Virtual Machine, which executes in a distributed fashion among collaborating nodes of a dedicated cluster. It extends Java's multithreading mechanism to the parallel computing paradigm by transparent migration of independent application threads modeled in master worker paradigm. The goal is to obtain improved performance for computationally-intensive multi-threaded Java programs without modifying traditional JVM code, Java language or compiler implementation. Deploying a master worker relationship among the nodes in the cluster, CEJVM monitors, packs, transports and resurrects the Java threads on \"Master JVM\" and helper \"Worker JVMs\". Currently, we have created a prototype that runs pure Java applications on the local Ethernet based cluster of Win2K computers. We have achieved appreciable speedup for an ANN training program written in Java (with independent threads) on CEJVM without any modification to the program's source code.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"62 3 1","pages":"389-393"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87738134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Efficient barrier using remote memory operations on VIA-based clusters 在基于via的集群上使用远程内存操作的高效屏障
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137732
Rinku Gupta, V. Tipparaju, J. Nieplocha, D. Panda
Most high performance scientific applications require efficient support for collective communication. Point-to-point message-passing communication in current generation clusters are based on the Send/Recv communication model. Collective communication operations built on top of such point-to-point message-passing operations might achieve suboptimal performance. VIA and the emerging InfiniBand architecture support remote DMA operations, which allow data to be moved between the nodes with low overhead; they also allow to create and provide a logical shared memory address space across the nodes. In this paper we focus on barrier, a frequently-used collective operations. We demonstrate how RDMA write operations can be used to support an inter-node barrier in a cluster with SMP nodes. Combining this with a scheme to exploit shared memory within a SMP node, we develop a fast barrier algorithm for a cluster of SMP nodes with a cLAN VIA interconnect. Compared to current barrier algorithms using the Send/Recv communication model, the new approach is shown to reduce barrier latency on a 64 processor (32 dual nodes) system by up to 66%. These results demonstrate that high performance and scalable barrier implementations can be delivered on current and next generation VIA/Infiniband-based clusters with RDMA support.
大多数高性能科学应用都需要有效的集体通信支持。当前代集群中的点对点消息传递通信基于Send/Recv通信模型。建立在这种点对点消息传递操作之上的集体通信操作可能会实现次优性能。VIA和新兴的InfiniBand架构支持远程DMA操作,允许数据以低开销在节点之间移动;它们还允许在节点间创建和提供逻辑共享内存地址空间。本文主要研究了一种常用的集体操作——屏障。我们将演示如何使用RDMA写操作来支持具有SMP节点的集群中的节点间屏障。结合利用SMP节点内共享内存的方案,我们为具有cLAN VIA互连的SMP节点集群开发了一种快速屏障算法。与目前使用Send/Recv通信模型的屏障算法相比,新方法可以将64处理器(32个双节点)系统上的屏障延迟减少66%。这些结果表明,高性能和可扩展的屏障实现可以在支持RDMA的当前和下一代基于VIA/ infiniband的集群上实现。
{"title":"Efficient barrier using remote memory operations on VIA-based clusters","authors":"Rinku Gupta, V. Tipparaju, J. Nieplocha, D. Panda","doi":"10.1109/CLUSTR.2002.1137732","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137732","url":null,"abstract":"Most high performance scientific applications require efficient support for collective communication. Point-to-point message-passing communication in current generation clusters are based on the Send/Recv communication model. Collective communication operations built on top of such point-to-point message-passing operations might achieve suboptimal performance. VIA and the emerging InfiniBand architecture support remote DMA operations, which allow data to be moved between the nodes with low overhead; they also allow to create and provide a logical shared memory address space across the nodes. In this paper we focus on barrier, a frequently-used collective operations. We demonstrate how RDMA write operations can be used to support an inter-node barrier in a cluster with SMP nodes. Combining this with a scheme to exploit shared memory within a SMP node, we develop a fast barrier algorithm for a cluster of SMP nodes with a cLAN VIA interconnect. Compared to current barrier algorithms using the Send/Recv communication model, the new approach is shown to reduce barrier latency on a 64 processor (32 dual nodes) system by up to 66%. These results demonstrate that high performance and scalable barrier implementations can be delivered on current and next generation VIA/Infiniband-based clusters with RDMA support.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"131 1","pages":"83-90"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74901901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
MyVIA: a design and implementation of the high performance Virtual Interface Architecture MyVIA:一个高性能虚拟接口架构的设计和实现
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137741
Yu Chen, Xiaoge Wang, Z. Jiao, Jun Xie, Zhihui Du, Sanli Li
Virtual Interface Architecture (VIA) established a communication model with low latency and high bandwidth, and defined the standard of user-level high-performance communication specification in cluster systems. This paper analyzes the current development, principle and implementations of VIA, and presents user-level high-performance communication software, MyVIA, based on Myrinet, which is comfortable with VIA specification. The paper first describes the design principle and framework of MyVIA, then proposes new technologies of MyVIA including User TLB, continued host physical memory and varied NIC buffer, the pipelining communication based on resource and DMA chain, and physical descriptor ring. Experimental results of performance comparisons and analysis are presented; the one-way bandwidth of MyVIA for a 4 KB message is 250 MB/s, and the lowest one-way latency is 8.46 /spl mu/s, which shows that the performance of MyVIA surpassed that of other implementations of VIA.
虚拟接口体系结构(VIA)建立了低延迟、高带宽的通信模型,定义了集群系统中用户级高性能通信规范的标准。本文分析了VIA的发展现状、原理和实现,提出了一种符合VIA规范的基于Myrinet的用户级高性能通信软件MyVIA。本文首先介绍了MyVIA的设计原理和框架,然后提出了MyVIA的新技术,包括用户TLB、主机连续物理内存和可变网卡缓冲区、基于资源链和DMA链的流水线通信以及物理描述符环。给出了性能比较和分析的实验结果;对于4 KB的消息,MyVIA的单向带宽为250 MB/s,最低的单向延迟为8.46 /spl mu/s,这表明MyVIA的性能优于其他VIA实现。
{"title":"MyVIA: a design and implementation of the high performance Virtual Interface Architecture","authors":"Yu Chen, Xiaoge Wang, Z. Jiao, Jun Xie, Zhihui Du, Sanli Li","doi":"10.1109/CLUSTR.2002.1137741","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137741","url":null,"abstract":"Virtual Interface Architecture (VIA) established a communication model with low latency and high bandwidth, and defined the standard of user-level high-performance communication specification in cluster systems. This paper analyzes the current development, principle and implementations of VIA, and presents user-level high-performance communication software, MyVIA, based on Myrinet, which is comfortable with VIA specification. The paper first describes the design principle and framework of MyVIA, then proposes new technologies of MyVIA including User TLB, continued host physical memory and varied NIC buffer, the pipelining communication based on resource and DMA chain, and physical descriptor ring. Experimental results of performance comparisons and analysis are presented; the one-way bandwidth of MyVIA for a 4 KB message is 250 MB/s, and the lowest one-way latency is 8.46 /spl mu/s, which shows that the performance of MyVIA surpassed that of other implementations of VIA.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"511 1","pages":"160-167"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83789813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A new architecture for secure carrier-class clusters 安全的电信级集群的新架构
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137790
M. Pourzandi, I. Haddad, C. Levert, Miroslaw Zakrzewski, M. Dagenais
Traditionally the telecom industry has used clusters to meet its carrier-class requirements of high availability, reliability, and scalability, while relying on cost-effective hardware and software. Efficient cluster security is now an essential requirement and has not yet been addressed in a coherent fashion on clustered systems. This paper presents an approach for distributed security architecture that supports advanced security mechanisms for current and future security needs, targeted for carrier-class application servers running on clustered systems.
传统上,电信行业一直使用集群来满足其电信级的高可用性、可靠性和可伸缩性需求,同时依赖于经济高效的硬件和软件。高效的集群安全性现在是一项基本需求,但在集群系统上还没有以一致的方式得到解决。本文提出了一种分布式安全架构的方法,它支持当前和未来安全需求的高级安全机制,针对运行在集群系统上的运营商级应用服务器。
{"title":"A new architecture for secure carrier-class clusters","authors":"M. Pourzandi, I. Haddad, C. Levert, Miroslaw Zakrzewski, M. Dagenais","doi":"10.1109/CLUSTR.2002.1137790","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137790","url":null,"abstract":"Traditionally the telecom industry has used clusters to meet its carrier-class requirements of high availability, reliability, and scalability, while relying on cost-effective hardware and software. Efficient cluster security is now an essential requirement and has not yet been addressed in a coherent fashion on clustered systems. This paper presents an approach for distributed security architecture that supports advanced security mechanisms for current and future security needs, targeted for carrier-class application servers running on clustered systems.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"31 1","pages":"494-497"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83879695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Memory mapped networks: a new deal for distributed shared memories ? the SciFS experience 内存映射网络:分布式共享内存的新协议?SciFS的体验
Pub Date : 2002-09-23 DOI: 10.1109/CLUSTR.2002.1137751
E. Cecchet
Distributed Shared Memories (DSM) performance has always suffered from high network latencies and software communication layers with a large overhead. Memory mapped networks such as Scalable Coherent Interface (SCI) allow to reliably access remote memory without involving the operating system. To show how DSM systems can benefit from this technology, we have developed SciFS, a DSM tightly integrated with the operating system, that exploits the high performance and the remote memory access capabilities of SCI. We first show the respective advantages of two communications techniques with SCI: programmed IO (PIO) and remote DMA (RDMA). Then, we describe how to build a scalable page transfer mechanism by mixing PIO and RDMA. Despite the lack of a broadcast mechanism with SCI, we demonstrate that it is possible to build scalable synchronization primitives using PIO. Finally, we evaluate various consistency models with scientific computing applications from the Splash benchmark. We observe that, even if the rough network performance is good, it is not sufficient to obtain acceptable results with applications that require fine grain parallelism. However, we show that memory mapped networks provide an efficient hardware support to implement software DSM systems without requiring complex relaxed consistency models. This way, DSM design can be greatly simplified using this technology.
分布式共享内存(DSM)性能一直受到高网络延迟和软件通信层的影响,并且开销很大。内存映射网络,如可扩展连贯接口(SCI),允许在不涉及操作系统的情况下可靠地访问远程内存。为了展示DSM系统如何从这项技术中受益,我们开发了SciFS,这是一个与操作系统紧密集成的DSM,利用了SCI的高性能和远程内存访问能力。我们首先展示了SCI两种通信技术的各自优势:编程IO (PIO)和远程DMA (RDMA)。然后,我们描述了如何通过混合PIO和RDMA来构建可扩展的页面传输机制。尽管SCI缺乏广播机制,但我们证明了可以使用PIO构建可扩展的同步原语。最后,我们用来自Splash基准的科学计算应用程序评估了各种一致性模型。我们观察到,即使粗网络性能很好,对于需要细粒度并行性的应用程序,也不足以获得可接受的结果。然而,我们表明,内存映射网络为实现软件DSM系统提供了有效的硬件支持,而不需要复杂的宽松一致性模型。这样,使用该技术可以大大简化DSM设计。
{"title":"Memory mapped networks: a new deal for distributed shared memories ? the SciFS experience","authors":"E. Cecchet","doi":"10.1109/CLUSTR.2002.1137751","DOIUrl":"https://doi.org/10.1109/CLUSTR.2002.1137751","url":null,"abstract":"Distributed Shared Memories (DSM) performance has always suffered from high network latencies and software communication layers with a large overhead. Memory mapped networks such as Scalable Coherent Interface (SCI) allow to reliably access remote memory without involving the operating system. To show how DSM systems can benefit from this technology, we have developed SciFS, a DSM tightly integrated with the operating system, that exploits the high performance and the remote memory access capabilities of SCI. We first show the respective advantages of two communications techniques with SCI: programmed IO (PIO) and remote DMA (RDMA). Then, we describe how to build a scalable page transfer mechanism by mixing PIO and RDMA. Despite the lack of a broadcast mechanism with SCI, we demonstrate that it is possible to build scalable synchronization primitives using PIO. Finally, we evaluate various consistency models with scientific computing applications from the Splash benchmark. We observe that, even if the rough network performance is good, it is not sufficient to obtain acceptable results with applications that require fine grain parallelism. However, we show that memory mapped networks provide an efficient hardware support to implement software DSM systems without requiring complex relaxed consistency models. This way, DSM design can be greatly simplified using this technology.","PeriodicalId":92128,"journal":{"name":"Proceedings. IEEE International Conference on Cluster Computing","volume":"97 1","pages":"231-238"},"PeriodicalIF":0.0,"publicationDate":"2002-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85266469","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings. IEEE International Conference on Cluster Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1