首页 > 最新文献

Proceedings the Ninth International Symposium on High-Performance Distributed Computing最新文献

英文 中文
Prophesy: an infrastructure for analyzing and modeling the performance of parallel and distributed applications 预言:用于分析和建模并行和分布式应用程序性能的基础设施
Xingfu Wu, V. Taylor, J. Geisler, X. Li, Z. Lan, R. Stevens, M. Hereld, I. Judson
Efficient execution of applications requires insight into how the system features impact the performance of the application. For distributed systems, the task of gaining this insight is complicated by the complexity of the system features. This insight generally results from significant experimental analysis and possibly the development of performance models. This paper presents the Prophesy project, an infrastructure that aids in gaining this needed insight based upon experience. The core component of Prophesy is a relational database that allows for the recording of performance data, system features and application details.
有效地执行应用程序需要深入了解系统特性如何影响应用程序的性能。对于分布式系统,获得这种洞察力的任务由于系统特性的复杂性而变得复杂。这种见解通常来自重要的实验分析和可能的性能模型的开发。本文介绍了预言项目,这是一个基础设施,可以帮助您根据经验获得所需的洞察力。预言的核心组件是一个关系数据库,它允许记录性能数据、系统特性和应用程序细节。
{"title":"Prophesy: an infrastructure for analyzing and modeling the performance of parallel and distributed applications","authors":"Xingfu Wu, V. Taylor, J. Geisler, X. Li, Z. Lan, R. Stevens, M. Hereld, I. Judson","doi":"10.1109/HPDC.2000.868668","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868668","url":null,"abstract":"Efficient execution of applications requires insight into how the system features impact the performance of the application. For distributed systems, the task of gaining this insight is complicated by the complexity of the system features. This insight generally results from significant experimental analysis and possibly the development of performance models. This paper presents the Prophesy project, an infrastructure that aids in gaining this needed insight based upon experience. The core component of Prophesy is a relational database that allows for the recording of performance data, system features and application details.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115066595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
RAID-x: a new distributed disk array for I/O-centric cluster computing RAID-x:一种新型分布式磁盘阵列,适用于以I/ o为中心的集群计算
K. Hwang, Hai Jin, Roy S. C. Ho
A new RAID-x (redundant array of inexpensive disks at level x) architecture is presented for distributed I/O processing on a serverless cluster of computers. The RAID-x architecture is based on a new concept of orthogonal striping and mirroring (OSM) across all distributed disks in the cluster. The primary advantages of this OSM approach lie in: (1) a significant improvement in parallel I/O bandwidth; (2) hiding disk mirroring overhead in the background; and (3) greatly enhanced scalability and reliability in cluster computing applications. All claimed advantages are substantiated with benchmark performance results on the Trojans cluster built at USC in 1999. The authors discuss the issues of scalable I/O performance, enhanced system reliability, and striped checkpointing on distributed RAID-x in a serverless cluster environment.
提出了一种新的RAID-x (x级廉价磁盘冗余阵列)体系结构,用于在无服务器计算机集群上进行分布式I/O处理。RAID-x架构基于跨集群中所有分布式磁盘的正交条带和镜像(OSM)的新概念。这种OSM方法的主要优点在于:(1)显著提高并行I/O带宽;(2)在后台隐藏磁盘镜像开销;(3)大大提高了集群计算应用的可扩展性和可靠性。所有声称的优势都与1999年在USC构建的木马集群上的基准性能结果相证实。作者讨论了在无服务器集群环境中分布式RAID-x的可伸缩I/O性能、增强的系统可靠性和条纹检查点等问题。
{"title":"RAID-x: a new distributed disk array for I/O-centric cluster computing","authors":"K. Hwang, Hai Jin, Roy S. C. Ho","doi":"10.1109/HPDC.2000.868660","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868660","url":null,"abstract":"A new RAID-x (redundant array of inexpensive disks at level x) architecture is presented for distributed I/O processing on a serverless cluster of computers. The RAID-x architecture is based on a new concept of orthogonal striping and mirroring (OSM) across all distributed disks in the cluster. The primary advantages of this OSM approach lie in: (1) a significant improvement in parallel I/O bandwidth; (2) hiding disk mirroring overhead in the background; and (3) greatly enhanced scalability and reliability in cluster computing applications. All claimed advantages are substantiated with benchmark performance results on the Trojans cluster built at USC in 1999. The authors discuss the issues of scalable I/O performance, enhanced system reliability, and striped checkpointing on distributed RAID-x in a serverless cluster environment.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128999616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Automatic configuration and run-time adaptation of distributed applications 分布式应用程序的自动配置和运行时适应
Fangzhe Chang, V. Karamcheti
Increased platform heterogeneity and varying resource availability in distributed systems motivates the design of resource-aware applications, which ensure a desired performance level by continuously adapting their behavior to changing resource characteristics. In this paper, we describe an application-independent adaptation framework that simplifies the design of resource-aware applications. This framework eliminates the need for adaptation decisions to be explicitly programmed into the application by relying on two novel components: (1) a tunability interface, which exposes adaptation choices in the form of alternate application configurations while encapsulating core application functionality, and (2) a virtual execution environment, which emulates application execution under diverse resource availability enabling off-line collection of information about the resulting behavior. Together, these components permit automatic run-time decisions on when to adapt by continuously monitoring resource conditions and application progress, and how to adapt by dynamically choosing the application configuration that is most appropriate for the prescribed user preference. We evaluate the framework using an interactive distributed image visualization application. The framework permits automatic adaptation to changes in CPU load and network bandwidth by choosing a different compression algorithm or by controlling the image transmission sequence so as to satisfy user preferences of visualization quality and timeliness.
分布式系统中不断增加的平台异构性和不断变化的资源可用性激发了资源感知应用程序的设计,这些应用程序通过不断调整其行为以适应不断变化的资源特征来确保所需的性能水平。在本文中,我们描述了一个独立于应用程序的自适应框架,它简化了资源感知应用程序的设计。该框架通过依赖于两个新的组件,消除了将适应决策显式地编程到应用程序中的需要:(1)可调接口,它在封装核心应用程序功能的同时,以替代应用程序配置的形式公开适应选择;(2)虚拟执行环境,它模拟不同资源可用性下的应用程序执行,支持有关结果行为的离线信息收集。总之,这些组件允许通过持续监视资源条件和应用程序进度来自动决定何时进行调整,以及如何通过动态选择最适合指定用户首选项的应用程序配置来进行调整。我们使用交互式分布式图像可视化应用程序来评估该框架。该框架可以通过选择不同的压缩算法或控制图像传输顺序来自动适应CPU负载和网络带宽的变化,从而满足用户对可视化质量和时效性的偏好。
{"title":"Automatic configuration and run-time adaptation of distributed applications","authors":"Fangzhe Chang, V. Karamcheti","doi":"10.1109/HPDC.2000.868630","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868630","url":null,"abstract":"Increased platform heterogeneity and varying resource availability in distributed systems motivates the design of resource-aware applications, which ensure a desired performance level by continuously adapting their behavior to changing resource characteristics. In this paper, we describe an application-independent adaptation framework that simplifies the design of resource-aware applications. This framework eliminates the need for adaptation decisions to be explicitly programmed into the application by relying on two novel components: (1) a tunability interface, which exposes adaptation choices in the form of alternate application configurations while encapsulating core application functionality, and (2) a virtual execution environment, which emulates application execution under diverse resource availability enabling off-line collection of information about the resulting behavior. Together, these components permit automatic run-time decisions on when to adapt by continuously monitoring resource conditions and application progress, and how to adapt by dynamically choosing the application configuration that is most appropriate for the prescribed user preference. We evaluate the framework using an interactive distributed image visualization application. The framework permits automatic adaptation to changes in CPU load and network bandwidth by choosing a different compression algorithm or by controlling the image transmission sequence so as to satisfy user preferences of visualization quality and timeliness.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115796947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 76
2K: a distributed operating system for dynamic heterogeneous environments 2K:用于动态异构环境的分布式操作系统
Fabio Kon, R. Campbell, M. D. Mickunas, K. Nahrstedt, Francisco J. Ballesteros
The first decades of the new millennium will witness an explosive growth in the number and diversity of networked devices and portals. We foresee high degrees of mobility, heterogeneity, and interactions among computing devices connected to global networks. While previous research in distributed operating systems solved many problems related to resource management, they seldom addressed the problems of heterogeneity and dynamic adaptability. On the other hand, middleware solutions, like CORBA and Java/Jini, solve part of the heterogeneity problem by permitting seamless communication among different platforms. But, they do not address dynamic resource management and adaptability for applications requiring high-performance distributed computing. This paper presents 2K, an integrated operating system architecture that addresses the problems of resource management in heterogeneous networks, dynamic adaptability and configuration of component-based distributed applications.
新千年的头几十年将见证网络设备和门户的数量和多样性的爆炸式增长。我们预见到连接到全球网络的计算设备之间的高度移动性、异质性和交互性。以往的分布式操作系统研究解决了许多与资源管理相关的问题,但很少解决异构性和动态适应性问题。另一方面,中间件解决方案,如CORBA和Java/Jini,通过允许不同平台之间的无缝通信解决了部分异构性问题。但是,它们不能解决需要高性能分布式计算的应用程序的动态资源管理和适应性问题。本文提出了一种集成的操作系统体系结构2K,它解决了异构网络中的资源管理、基于组件的分布式应用程序的动态适应性和配置问题。
{"title":"2K: a distributed operating system for dynamic heterogeneous environments","authors":"Fabio Kon, R. Campbell, M. D. Mickunas, K. Nahrstedt, Francisco J. Ballesteros","doi":"10.1109/HPDC.2000.868651","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868651","url":null,"abstract":"The first decades of the new millennium will witness an explosive growth in the number and diversity of networked devices and portals. We foresee high degrees of mobility, heterogeneity, and interactions among computing devices connected to global networks. While previous research in distributed operating systems solved many problems related to resource management, they seldom addressed the problems of heterogeneity and dynamic adaptability. On the other hand, middleware solutions, like CORBA and Java/Jini, solve part of the heterogeneity problem by permitting seamless communication among different platforms. But, they do not address dynamic resource management and adaptability for applications requiring high-performance distributed computing. This paper presents 2K, an integrated operating system architecture that addresses the problems of resource management in heterogeneous networks, dynamic adaptability and configuration of component-based distributed applications.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115001757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 136
Interfacing wide-area network computing and cluster management software: Condor, DQS and PBS via PUNCH 通过PUNCH接口广域网计算和集群管理软件:Condor, DQS和PBS
S. Adabala, N. Kapadia, J. Fortes
This paper outlines the issues that must be addressed in order to allow cluster management systems such as Condor, DQS (Distributed Queueing Service) and PBS (Portable Batch System) to be transparently used via a wide-area network computing system such as PUNCH (Purdue University Network Computing Hubs).
本文概述了必须解决的问题,以便允许集群管理系统(如Condor, DQS(分布式队列服务)和PBS(便携式批处理系统)通过诸如PUNCH(普渡大学网络计算中心)之类的广域网计算系统透明地使用。
{"title":"Interfacing wide-area network computing and cluster management software: Condor, DQS and PBS via PUNCH","authors":"S. Adabala, N. Kapadia, J. Fortes","doi":"10.1109/HPDC.2000.868670","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868670","url":null,"abstract":"This paper outlines the issues that must be addressed in order to allow cluster management systems such as Condor, DQS (Distributed Queueing Service) and PBS (Portable Batch System) to be transparently used via a wide-area network computing system such as PUNCH (Purdue University Network Computing Hubs).","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"60 26","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120816548","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Probe - a distributed storage testbed Probe——分布式存储测试平台
R. D. Burris, M. Gleicher, H. Holmes, D. Million, S. R. White
As computers become more capable, researchers of all types are finding it necessary to store massive quantities of data generated by simulations or experiments and to retrieve them at high rate for analysis or visualization. Strong needs have arisen for storage systems tuned for particular needs; significant improvements in storage speed and access control; optimized wide area network bulk transfers; utilization of new media and new types of storage devices; and development, testing, and use of user-written storage applications. The Oak Ridge National Laboratory (ORNL) and the National Energy Research Scientific Computing Center (NERSC) have formed a wide-area distributed testbed, entitled "Probe"-, to support challenging storage-related studies.
随着计算机的功能越来越强大,所有类型的研究人员都发现有必要存储由模拟或实验产生的大量数据,并以高速率检索它们以进行分析或可视化。对于针对特定需求进行调整的存储系统,出现了强烈的需求;存储速度和访问控制方面的显著改进;优化广域网批量传输;新媒体和新型存储设备的使用;以及开发、测试和使用用户编写的存储应用程序。橡树岭国家实验室(ORNL)和国家能源研究科学计算中心(NERSC)组成了一个名为“探针”的广域分布式测试平台,以支持具有挑战性的存储相关研究。
{"title":"Probe - a distributed storage testbed","authors":"R. D. Burris, M. Gleicher, H. Holmes, D. Million, S. R. White","doi":"10.1109/HPDC.2000.868664","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868664","url":null,"abstract":"As computers become more capable, researchers of all types are finding it necessary to store massive quantities of data generated by simulations or experiments and to retrieve them at high rate for analysis or visualization. Strong needs have arisen for storage systems tuned for particular needs; significant improvements in storage speed and access control; optimized wide area network bulk transfers; utilization of new media and new types of storage devices; and development, testing, and use of user-written storage applications. The Oak Ridge National Laboratory (ORNL) and the National Energy Research Scientific Computing Center (NERSC) have formed a wide-area distributed testbed, entitled \"Probe\"-, to support challenging storage-related studies.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114260610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speculative defragmentation - a technique to improve the communication software efficiency for Gigabit Ethernet 推测碎片整理-一种提高千兆以太网通信软件效率的技术
C. Kurmann, Michael Müller, F. Rauch, T. Stricker
Cluster platforms offer good computational performance, but they still cannot utilize the potential of Gbit/s communication technology. While the speed of the Ethernet has grown to 1 Gbit/s, the functionality and the architectural support in the network interfaces has remained the same for more than a decade, so that the memory system becomes a limiting factor. To sustain the raw network speed in applications, a "zero-copy" network interface architecture would be required, but, for all widely used stacks, a last copy is required for the (de)fragmentation of the transferred network packets, since Ethernet packets are smaller than a page size. Correctly defragmenting packets of various communication protocols in hardware is an extremely complex task. We therefore consider a speculative defragmentation technique that can eliminate the last defragmenting copy operation in zero-copy TCP/IP stacks on existing hardware. The payload of fragmented packets is separated from the headers and stored in a memory page that can be mapped directly to its final destination in user memory. To evaluate our ideas, we integrated a network interface driver with speculative defragmentation into an existing protocol stack and added well-known page remapping and fast buffer strategies. Measurements indicate that we can improve the performance for a Gigabit Ethernet over a standard Linux 2.2 TCP/IP stack by a factor of 1.5-2 for uninterrupted burst transfers. Furthermore, our study demonstrates good speculation success rates for a database and a scientific application code on a cluster of PCs.
集群平台提供了良好的计算性能,但仍不能充分利用Gbit/s通信技术的潜力。虽然以太网的速度已经增长到1 Gbit/s,但网络接口的功能和体系结构支持十多年来一直保持不变,因此内存系统成为一个限制因素。为了维持应用程序中的原始网络速度,需要“零拷贝”网络接口体系结构,但是,对于所有广泛使用的堆栈,需要最后一份拷贝来传输网络数据包(去)碎片,因为以太网数据包小于页面大小。正确地整理硬件中各种通信协议的数据包是一项极其复杂的任务。因此,我们考虑了一种推测性的碎片整理技术,它可以消除现有硬件上零拷贝TCP/IP堆栈中的最后一个碎片整理复制操作。碎片包的有效负载与报头分离并存储在一个内存页中,该内存页可以直接映射到用户内存中的最终目的地。为了评估我们的想法,我们将具有推测碎片整理功能的网络接口驱动程序集成到现有的协议堆栈中,并添加了众所周知的页面重映射和快速缓冲策略。测量表明,我们可以在标准Linux 2.2 TCP/IP堆栈上将千兆以太网的性能提高1.5-2倍,以实现不间断的突发传输。此外,我们的研究表明,在pc集群上,数据库和科学应用程序代码的投机成功率很高。
{"title":"Speculative defragmentation - a technique to improve the communication software efficiency for Gigabit Ethernet","authors":"C. Kurmann, Michael Müller, F. Rauch, T. Stricker","doi":"10.1109/HPDC.2000.868643","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868643","url":null,"abstract":"Cluster platforms offer good computational performance, but they still cannot utilize the potential of Gbit/s communication technology. While the speed of the Ethernet has grown to 1 Gbit/s, the functionality and the architectural support in the network interfaces has remained the same for more than a decade, so that the memory system becomes a limiting factor. To sustain the raw network speed in applications, a \"zero-copy\" network interface architecture would be required, but, for all widely used stacks, a last copy is required for the (de)fragmentation of the transferred network packets, since Ethernet packets are smaller than a page size. Correctly defragmenting packets of various communication protocols in hardware is an extremely complex task. We therefore consider a speculative defragmentation technique that can eliminate the last defragmenting copy operation in zero-copy TCP/IP stacks on existing hardware. The payload of fragmented packets is separated from the headers and stored in a memory page that can be mapped directly to its final destination in user memory. To evaluate our ideas, we integrated a network interface driver with speculative defragmentation into an existing protocol stack and added well-known page remapping and fast buffer strategies. Measurements indicate that we can improve the performance for a Gigabit Ethernet over a standard Linux 2.2 TCP/IP stack by a factor of 1.5-2 for uninterrupted burst transfers. Furthermore, our study demonstrates good speculation success rates for a database and a scientific application code on a cluster of PCs.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132565013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Distributed processor allocation in large PC clusters 大型PC集群中的分布式处理器分配
Hans-Ulrich Heiß, C. Rose, P. Navaux
Current processor allocation techniques for highly parallel systems are based on centralized front-end based algorithms. As a result, the applied strategies are restricted to static allocation, low parallelism and weak fault tolerance. To lift these restrictions, we are investigating a distributed approach to the processor allocation problem in large distributed memory machines. A contiguous and a noncontiguous version of a distributed dynamic processor allocation strategy are proposed and studied. Simulations compare the performance of the proposed strategies with that of well-known centralized algorithms. We also present the results of experiments on a Simens hpcline Primergy Server with 96 nodes that show distributed allocation is feasible with current technologies.
当前高度并行系统的处理器分配技术是基于集中式前端算法的。因此,应用的策略局限于静态分配、低并行性和弱容错性。为了解除这些限制,我们正在研究一种分布式方法来解决大型分布式内存机器中的处理器分配问题。提出并研究了一种连续和非连续的分布式动态处理器分配策略。仿真比较了所提策略与知名集中式算法的性能。本文还介绍了在西门子的96个节点的hpcline Primergy服务器上的实验结果,表明分布式分配在现有技术下是可行的。
{"title":"Distributed processor allocation in large PC clusters","authors":"Hans-Ulrich Heiß, C. Rose, P. Navaux","doi":"10.1109/HPDC.2000.868661","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868661","url":null,"abstract":"Current processor allocation techniques for highly parallel systems are based on centralized front-end based algorithms. As a result, the applied strategies are restricted to static allocation, low parallelism and weak fault tolerance. To lift these restrictions, we are investigating a distributed approach to the processor allocation problem in large distributed memory machines. A contiguous and a noncontiguous version of a distributed dynamic processor allocation strategy are proposed and studied. Simulations compare the performance of the proposed strategies with that of well-known centralized algorithms. We also present the results of experiments on a Simens hpcline Primergy Server with 96 nodes that show distributed allocation is feasible with current technologies.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122390235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
An enabling framework for master-worker applications on the Computational Grid 一个支持计算网格上的主工应用程序的框架
Jean-Pierre Goux, Sanjeev Kulkarni, Jeff T. Linderoth, Michael Yoder
Describes MW (Master-Worker) - a software framework that allows users to quickly and easily parallelize scientific computations using the master-worker paradigm on the Computational Grid. MW provides both a "top-level" interface to application software and a "bottom-level" interface to existing Grid computing toolkits. Both interfaces are briefly described. We conclude with a case study, where the necessary Grid services are provided by the Condor high-throughput computing system, and the MW-enabled application code is used to solve a combinatorial optimization problem of unprecedented complexity.
描述了MW (Master-Worker)——一个软件框架,它允许用户在计算网格上使用Master-Worker范式快速、轻松地并行化科学计算。MW为应用软件提供了“顶层”接口,也为现有的网格计算工具包提供了“底层”接口。简要介绍了这两个接口。最后,我们给出了一个案例研究,其中必要的网格服务由Condor高吞吐量计算系统提供,支持mw的应用程序代码用于解决前所未有的复杂性的组合优化问题。
{"title":"An enabling framework for master-worker applications on the Computational Grid","authors":"Jean-Pierre Goux, Sanjeev Kulkarni, Jeff T. Linderoth, Michael Yoder","doi":"10.1109/HPDC.2000.868633","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868633","url":null,"abstract":"Describes MW (Master-Worker) - a software framework that allows users to quickly and easily parallelize scientific computations using the master-worker paradigm on the Computational Grid. MW provides both a \"top-level\" interface to application software and a \"bottom-level\" interface to existing Grid computing toolkits. Both interfaces are briefly described. We conclude with a case study, where the necessary Grid services are provided by the Condor high-throughput computing system, and the MW-enabled application code is used to solve a combinatorial optimization problem of unprecedented complexity.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127775981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 265
High performance communication using a commodity network for cluster systems 集群系统使用商品网络进行高性能通信
S. Sumimoto, H. Tezuka, A. Hori, H. Harada, Toshiyuki Takahashi, Y. Ishikawa
Proposes a scheme to realize a high-performance communication facility using a commodity network. This scheme does not require any special hardware or hardware-specific device drivers in order to adapt to many kinds of network interface cards (NICs). In this scheme, a reliable lightweight network protocol is handled directly on a data link layer called by a network device driver. An interrupt reaping technique is proposed to eliminate the hardware interrupt overhead when an application waits for a message. PM/Ethernet, an instance of the scheme, is implemented on Linux with minimal modification to the Linux kernel, and existing network device drivers are used without any modification. Using Pentium III 500-MHz PCs on Packet Engine's G-NIC II Gigabit Ethernet NIC, it achieves 77.5 MB/s bandwidth and 37.6 /spl mu/s round-trip time latency compared to that of TCP/IP, which achieves 46.7 MB/s bandwidth and 89.6 /spl mu/s round-trip time latency. The NAS parallel benchmark IS results show that MPI on PM/Ethernet achieves 75% better performance than MPI on TCP/IP and is 7.8% slower than that of MPI on Myrinet PM.
提出了一种利用商品网络实现高性能通信设备的方案。该方案不需要任何特殊的硬件或特定于硬件的设备驱动程序,以适应多种网络接口卡(nic)。在这个方案中,一个可靠的轻量级网络协议直接在由网络设备驱动程序调用的数据链路层上处理。提出了一种中断收获技术,以消除应用程序等待消息时的硬件中断开销。PM/Ethernet是该方案的一个实例,在Linux上实现,对Linux内核进行了最小的修改,并且使用现有的网络设备驱动程序而不做任何修改。在Packet Engine的G-NIC II千兆以太网网卡上使用奔腾III 500 mhz pc机,与TCP/IP相比,实现了77.5 MB/s的带宽和37.6 /spl mu/s的往返时间延迟,TCP/IP实现了46.7 MB/s的带宽和89.6 /spl mu/s的往返时间延迟。NAS并行基准IS测试结果表明,MPI在PM/Ethernet上的性能比在TCP/IP上的性能高75%,比在Myrinet PM上的性能低7.8%。
{"title":"High performance communication using a commodity network for cluster systems","authors":"S. Sumimoto, H. Tezuka, A. Hori, H. Harada, Toshiyuki Takahashi, Y. Ishikawa","doi":"10.1109/HPDC.2000.868644","DOIUrl":"https://doi.org/10.1109/HPDC.2000.868644","url":null,"abstract":"Proposes a scheme to realize a high-performance communication facility using a commodity network. This scheme does not require any special hardware or hardware-specific device drivers in order to adapt to many kinds of network interface cards (NICs). In this scheme, a reliable lightweight network protocol is handled directly on a data link layer called by a network device driver. An interrupt reaping technique is proposed to eliminate the hardware interrupt overhead when an application waits for a message. PM/Ethernet, an instance of the scheme, is implemented on Linux with minimal modification to the Linux kernel, and existing network device drivers are used without any modification. Using Pentium III 500-MHz PCs on Packet Engine's G-NIC II Gigabit Ethernet NIC, it achieves 77.5 MB/s bandwidth and 37.6 /spl mu/s round-trip time latency compared to that of TCP/IP, which achieves 46.7 MB/s bandwidth and 89.6 /spl mu/s round-trip time latency. The NAS parallel benchmark IS results show that MPI on PM/Ethernet achieves 75% better performance than MPI on TCP/IP and is 7.8% slower than that of MPI on Myrinet PM.","PeriodicalId":400728,"journal":{"name":"Proceedings the Ninth International Symposium on High-Performance Distributed Computing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128092352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
期刊
Proceedings the Ninth International Symposium on High-Performance Distributed Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1