首页 > 最新文献

High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on最新文献

英文 中文
An environment for enabling interactive grids 支持交互式网格的环境
V. Talwar, Sujoy Basu, Raj Kumar
Traditional use of grid computing allows a user to submit batch jobs in a grid environment. We believe, next generation grids will extend the application domain to include interactive graphical sessions. We term such grids interactive grids. In this paper, we describe some of the challenges involved in building interactive grids. These include fine grain access control, QoS guarantees, and dynamic account management. In order to architect interactive grids, we propose and describe I-GENV, an environment for enabling interactive grids. I-GENV consists of GISH-'Grid Interactive Shell', Controlled Desktop, SAC-'Session Admission Control' module, GMMA-'Grid Monitoring and Management Agents', System Policies, and Dynamic Account Manager. We also present our testbed implementation of I-GENV using and extending Globus Toolkit 2.0 for the Grid middleware infrastructure, and VNC as the remote display technology.
网格计算的传统用法允许用户在网格环境中提交批处理作业。我们相信,下一代网格将扩展应用领域,包括交互式图形会话。我们称这种网格为交互网格。在本文中,我们描述了构建交互式网格所涉及的一些挑战。其中包括细粒度访问控制、QoS保证和动态帐户管理。为了构建交互式网格,我们提出并描述了I-GENV,一个实现交互式网格的环境。I-GENV由GISH (Grid Interactive Shell)、受控桌面、SAC (Session Admission Control)模块、GMMA (Grid Monitoring and Management Agents)、系统策略和动态客户管理器组成。我们还介绍了I-GENV的测试平台实现,它使用并扩展了用于网格中间件基础设施的Globus Toolkit 2.0,并使用VNC作为远程显示技术。
{"title":"An environment for enabling interactive grids","authors":"V. Talwar, Sujoy Basu, Raj Kumar","doi":"10.1109/HPDC.2003.1210028","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210028","url":null,"abstract":"Traditional use of grid computing allows a user to submit batch jobs in a grid environment. We believe, next generation grids will extend the application domain to include interactive graphical sessions. We term such grids interactive grids. In this paper, we describe some of the challenges involved in building interactive grids. These include fine grain access control, QoS guarantees, and dynamic account management. In order to architect interactive grids, we propose and describe I-GENV, an environment for enabling interactive grids. I-GENV consists of GISH-'Grid Interactive Shell', Controlled Desktop, SAC-'Session Admission Control' module, GMMA-'Grid Monitoring and Management Agents', System Policies, and Dynamic Account Manager. We also present our testbed implementation of I-GENV using and extending Globus Toolkit 2.0 for the Grid middleware infrastructure, and VNC as the remote display technology.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127999535","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Resource-aware stream management with the customizable dproc distributed monitoring mechanisms 具有可定制dproc分布式监控机制的资源感知流管理
S. Agarwala, C. Poellabauer, J. Kong, K. Schwan, M. Wolf
Monitoring the resources of distributed systems is essential to the successful deployment and execution of grid applications, particularly when such applications have well-defined QoS requirements. The dproc system-level monitoring mechanisms implemented for standard Linux kernels have several key components. First, utilizing the familiar /proc filesystem, dproc extends this interface with resource information collected from both local and remote hosts. Second, to predictably capture and distribute monitoring information, dproc uses a kernel-level group communication facility, termed KECho, which is based on events and event channels. Third and the focus of this paper is dproc's run-time customizability for resource monitoring, which includes the generation and deployment of monitoring functionality within remote operating system kernels. Using dproc, we show that: (a) data streams can be customized according to a client's resource availabilities (dynamic stream management); (b) by dynamically varying distributed monitoring (dynamic filtering of monitoring information), appropriate balance can be maintained between monitoring overheads and application quality; and (c) by performing monitoring at kernel-level, the information captured enables decision making that takes into account the multiple resources used by applications.
监视分布式系统的资源对于网格应用程序的成功部署和执行至关重要,特别是当这些应用程序具有定义良好的QoS需求时。为标准Linux内核实现的dproc系统级监视机制有几个关键组件。首先,dproc利用我们熟悉的/proc文件系统,用从本地和远程主机收集的资源信息扩展这个接口。其次,为了可预测地捕获和分发监视信息,dproc使用称为KECho的内核级组通信工具,该工具基于事件和事件通道。第三,也是本文的重点是dproc用于资源监控的运行时可定制性,其中包括远程操作系统内核中监控功能的生成和部署。使用dproc,我们表明:(a)数据流可以根据客户端的资源可用性定制(动态流管理);(b)透过动态变化的分布式监控(动态过滤监控信息),可以在监控开销和应用质量之间保持适当的平衡;(c)通过在内核级执行监视,捕获的信息使决策能够考虑到应用程序使用的多种资源。
{"title":"Resource-aware stream management with the customizable dproc distributed monitoring mechanisms","authors":"S. Agarwala, C. Poellabauer, J. Kong, K. Schwan, M. Wolf","doi":"10.1109/HPDC.2003.1210034","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210034","url":null,"abstract":"Monitoring the resources of distributed systems is essential to the successful deployment and execution of grid applications, particularly when such applications have well-defined QoS requirements. The dproc system-level monitoring mechanisms implemented for standard Linux kernels have several key components. First, utilizing the familiar /proc filesystem, dproc extends this interface with resource information collected from both local and remote hosts. Second, to predictably capture and distribute monitoring information, dproc uses a kernel-level group communication facility, termed KECho, which is based on events and event channels. Third and the focus of this paper is dproc's run-time customizability for resource monitoring, which includes the generation and deployment of monitoring functionality within remote operating system kernels. Using dproc, we show that: (a) data streams can be customized according to a client's resource availabilities (dynamic stream management); (b) by dynamically varying distributed monitoring (dynamic filtering of monitoring information), appropriate balance can be maintained between monitoring overheads and application quality; and (c) by performing monitoring at kernel-level, the information captured enables decision making that takes into account the multiple resources used by applications.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133656007","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Trace-based simulations of processor co-allocation policies in multiclusters 基于跟踪的多集群处理器协同分配策略模拟
A. Bucur, D. Epema
In systems consisting of multiple clusters of processors which employ space sharing for scheduling jobs, such as our Distributed ASCI (Advanced School for Computing Imaging) Supercomputer (DAS), co-allocation, i.e., the simultaneous allocation of processors to single jobs in multiple clusters, may be required. In this paper we study the performance of several scheduling policies for co-allocating unordered requests in multiclusters with a workload derived from the DAS. We find that beside the policy, limiting the total job size significantly improves the performance, and that for a slowdown of jobs due to global communication bounded by 1.25, co-allocation is a viable choice.
在由多个处理器集群组成的系统中,使用空间共享来调度作业,例如我们的分布式ASCI(高级计算成像学院)超级计算机(DAS),可能需要共同分配,即同时将处理器分配给多个集群中的单个作业。在本文中,我们研究了几种调度策略在多集群中协同分配无序请求时的性能。我们发现,除了该策略外,限制总作业大小显著提高了性能,并且由于全局通信限制在1.25以内,作业速度减慢,共同分配是一个可行的选择。
{"title":"Trace-based simulations of processor co-allocation policies in multiclusters","authors":"A. Bucur, D. Epema","doi":"10.1109/HPDC.2003.1210017","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210017","url":null,"abstract":"In systems consisting of multiple clusters of processors which employ space sharing for scheduling jobs, such as our Distributed ASCI (Advanced School for Computing Imaging) Supercomputer (DAS), co-allocation, i.e., the simultaneous allocation of processors to single jobs in multiple clusters, may be required. In this paper we study the performance of several scheduling policies for co-allocating unordered requests in multiclusters with a workload derived from the DAS. We find that beside the policy, limiting the total job size significantly improves the performance, and that for a slowdown of jobs due to global communication bounded by 1.25, co-allocation is a viable choice.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134613346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 35
RUMR: robust scheduling for divisible workloads RUMR:可分割工作负载的健壮调度
Yang Yang, H. Casanova
Divisible workload applications arise in many fields of science and engineering. They can be parallelized in master-worker fashion and relevant scheduling strategies have been proposed to reduce application markspan. Our goal is to developed a practical divisible workload scheduling strategy. This requires that previous work be revisited as several usual assumptions about the computing platform do not hold in practice. We have partially addressed this concern in a previous paper via an algorithm that achieves high performance with realistic resource latency models. In this paper we extend our approach to account for performance prediction errors, which are expected for most real-world performance and applications. In essence, we combine ideas from multiround divisible workload scheduling, for performance, and from factoring-based scheduling, for robustness. We present simulation results to quantify the benefits of our approach compared to our original algorithm and to other previously proposed algorithms.
可分负载的应用出现在许多科学和工程领域。它们可以以主工方式并行化,并提出了相应的调度策略来减小应用程序的标记跨度。我们的目标是开发一种实用的可分割工作负载调度策略。这需要重新审视以前的工作,因为关于计算平台的几个通常假设在实践中并不成立。在之前的一篇论文中,我们已经通过一种算法部分解决了这个问题,该算法通过现实的资源延迟模型实现了高性能。在本文中,我们扩展了我们的方法来解释大多数实际性能和应用程序所期望的性能预测误差。从本质上讲,我们结合了多轮可分工作负载调度的思想,以提高性能,以及基于因式调度的思想,以提高鲁棒性。我们给出了仿真结果来量化我们的方法与我们的原始算法和其他先前提出的算法相比的好处。
{"title":"RUMR: robust scheduling for divisible workloads","authors":"Yang Yang, H. Casanova","doi":"10.1109/HPDC.2003.1210021","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210021","url":null,"abstract":"Divisible workload applications arise in many fields of science and engineering. They can be parallelized in master-worker fashion and relevant scheduling strategies have been proposed to reduce application markspan. Our goal is to developed a practical divisible workload scheduling strategy. This requires that previous work be revisited as several usual assumptions about the computing platform do not hold in practice. We have partially addressed this concern in a previous paper via an algorithm that achieves high performance with realistic resource latency models. In this paper we extend our approach to account for performance prediction errors, which are expected for most real-world performance and applications. In essence, we combine ideas from multiround divisible workload scheduling, for performance, and from factoring-based scheduling, for robustness. We present simulation results to quantify the benefits of our approach compared to our original algorithm and to other previously proposed algorithms.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125306851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
Optimizing GridFTP through dynamic right-sizing 通过动态调整大小来优化GridFTP
S. Thulasidasan, Wu-chun Feng, M. Gardner
In this paper, we describe the integration of dynamic right-sizing - an automatic and scalable buffer management technique for enhancing TCP (transport control protocol) performance - into GridFTP, a subsystem of the Globus Toolkit for managing bulk data transfers across computational Grids. Such Grids are often characterized by networks with large bandwidth-delay products. Unfortunately, many of today's Grid applications use only a small fraction of available bandwidth because the default buffer sizes in TCP are tuned for yesterday's WAN (wide access network) speeds. Buffer sizes can be manually tuned to allow TCP flow control to adapt to high-speed WAN environments, but this is a tedious process. Although recent work has shown how to automatically tune system buffers during connection set-up, these values may not be appropriate for the connection's lifetime due to varying network delay and throughput. We show how using the technique of dynamic right-sizing (DRS) in GridFTP helps us optimize memory usage while maintaining high throughput over the lifetime of the connection. We also show how DRS enhances important GridFTP features such as striped and third-party data transfers in a scalable way. The technique is implemented entirely in user space so that end users do not have to modify the kernel.
在本文中,我们描述了将动态调整大小(一种用于增强TCP(传输控制协议)性能的自动可扩展缓冲区管理技术)集成到GridFTP中的方法,GridFTP是Globus Toolkit的一个子系统,用于管理跨计算网格的批量数据传输。这种网格通常以具有大带宽延迟产品的网络为特征。不幸的是,今天的许多网格应用程序只使用了可用带宽的一小部分,因为TCP中的默认缓冲区大小是针对昨天的广域网(WAN)速度进行调整的。可以手动调整缓冲区大小,以允许TCP流控制适应高速WAN环境,但这是一个繁琐的过程。尽管最近的工作展示了如何在连接设置期间自动调优系统缓冲区,但由于网络延迟和吞吐量的变化,这些值可能不适合连接的生命周期。我们将展示在GridFTP中使用动态调整大小(DRS)技术如何帮助我们优化内存使用,同时在连接的整个生命周期内保持高吞吐量。我们还展示了DRS如何以可扩展的方式增强GridFTP的重要特性,如条纹和第三方数据传输。该技术完全在用户空间中实现,因此最终用户不必修改内核。
{"title":"Optimizing GridFTP through dynamic right-sizing","authors":"S. Thulasidasan, Wu-chun Feng, M. Gardner","doi":"10.1109/HPDC.2003.1210012","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210012","url":null,"abstract":"In this paper, we describe the integration of dynamic right-sizing - an automatic and scalable buffer management technique for enhancing TCP (transport control protocol) performance - into GridFTP, a subsystem of the Globus Toolkit for managing bulk data transfers across computational Grids. Such Grids are often characterized by networks with large bandwidth-delay products. Unfortunately, many of today's Grid applications use only a small fraction of available bandwidth because the default buffer sizes in TCP are tuned for yesterday's WAN (wide access network) speeds. Buffer sizes can be manually tuned to allow TCP flow control to adapt to high-speed WAN environments, but this is a tedious process. Although recent work has shown how to automatically tune system buffers during connection set-up, these values may not be appropriate for the connection's lifetime due to varying network delay and throughput. We show how using the technique of dynamic right-sizing (DRS) in GridFTP helps us optimize memory usage while maintaining high throughput over the lifetime of the connection. We also show how DRS enhances important GridFTP features such as striped and third-party data transfers in a scalable way. The technique is implemented entirely in user space so that end users do not have to modify the kernel.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115020452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Adaptive polling of grid resource monitors using a slacker coherence model 基于slacker相干模型的网格资源监视器自适应轮询
R. Sundaresan, Mario Lauria, T. Kurç, S. Parthasarathy, J. Saltz
As data and computational grids grow in size and complexity, the crucial task of identifying, monitoring and utilizing available resources in an efficient manner is becoming increasingly difficult. The design of monitoring systems that are scalable both in the number of sources being monitored and in the number of clients served is a challenging issue. In this paper we investigate the trade-offs of different polling strategies that can be used to monitor resource availability on machines in a distributed environment. We show how adaptive polling protocols can substantially increase scalability with a less than proportional loss of precision, and how these protocols can be personalized for different types of resource usage patterns.
随着数据和计算网格的规模和复杂性的增长,以有效的方式识别、监测和利用可用资源的关键任务变得越来越困难。监视系统的设计在被监视的源数量和所服务的客户机数量上都是可扩展的,这是一个具有挑战性的问题。在本文中,我们研究了可用于监视分布式环境中机器上的资源可用性的不同轮询策略的权衡。我们将展示自适应轮询协议如何在精度损失小于成比例的情况下大幅提高可伸缩性,以及如何针对不同类型的资源使用模式对这些协议进行个性化。
{"title":"Adaptive polling of grid resource monitors using a slacker coherence model","authors":"R. Sundaresan, Mario Lauria, T. Kurç, S. Parthasarathy, J. Saltz","doi":"10.1109/HPDC.2003.1210035","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210035","url":null,"abstract":"As data and computational grids grow in size and complexity, the crucial task of identifying, monitoring and utilizing available resources in an efficient manner is becoming increasingly difficult. The design of monitoring systems that are scalable both in the number of sources being monitored and in the number of clients served is a challenging issue. In this paper we investigate the trade-offs of different polling strategies that can be used to monitor resource availability on machines in a distributed environment. We show how adaptive polling protocols can substantially increase scalability with a less than proportional loss of precision, and how these protocols can be personalized for different types of resource usage patterns.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114941761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
The Ethernet approach to grid computing 网格计算的以太网方法
D. Thain, M. Livny
Despite many competitors, Ethernet became the dominant protocol for local area networking due to its simplicity, robustness, and efficiency in wide variety of conditions and technology. Reflecting on the current frailty of much software, grid and otherwise, we propose that the Ethernet approach to resource sharing is an effective and reliable technique for combining coarse-grained software when failures are common and poorly detailed. This approach involves placing several simple but important responsibilities on client software to acquire shared resources conservatively, to back off during periods of failure, and to inform competing clients when resources are in contention. We present a simple scripting language that simplifies and encourages the Ethernet approach, and demonstrate its use in several grid computing scenarios, including job submission, disk allocation, and data replication. We conclude with a discussion of the limitations of this approach, and describe how it is uniquely suited to high-level programming.
尽管有许多竞争对手,以太网由于其简单性、健壮性和在各种条件和技术下的效率而成为局域网的主导协议。考虑到当前许多软件、网格和其他方面的脆弱性,我们提出以太网资源共享方法是一种有效且可靠的技术,可用于在故障常见且不详细的情况下组合粗粒度软件。这种方法包括将几个简单但重要的责任放在客户机软件上,以保守地获取共享资源,在故障期间退出,并在资源争用时通知竞争的客户机。我们提供了一种简单的脚本语言,它简化并鼓励使用以太网方法,并演示了它在几个网格计算场景中的使用,包括作业提交、磁盘分配和数据复制。最后,我们讨论了这种方法的局限性,并描述了它如何独特地适用于高级编程。
{"title":"The Ethernet approach to grid computing","authors":"D. Thain, M. Livny","doi":"10.1109/HPDC.2003.1210024","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210024","url":null,"abstract":"Despite many competitors, Ethernet became the dominant protocol for local area networking due to its simplicity, robustness, and efficiency in wide variety of conditions and technology. Reflecting on the current frailty of much software, grid and otherwise, we propose that the Ethernet approach to resource sharing is an effective and reliable technique for combining coarse-grained software when failures are common and poorly detailed. This approach involves placing several simple but important responsibilities on client software to acquire shared resources conservatively, to back off during periods of failure, and to inform competing clients when resources are in contention. We present a simple scripting language that simplifies and encourages the Ethernet approach, and demonstrate its use in several grid computing scenarios, including job submission, disk allocation, and data replication. We conclude with a discussion of the limitations of this approach, and describe how it is uniquely suited to high-level programming.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123926790","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Zero-copy for CORBA - efficient communication for distributed object middleware 面向分布式对象中间件的CORBA高效通信零拷贝
C. Kurmann, T. Stricker
Many large applications require distributed computing for the sake of better performance and software systems that facilitate the development of such applications have attracted a great deal of attention. Modeling the application as distributed objects or components promises the benefits of better abstractions and increased software reuse. Using distributed object middleware (DOM) like CORBA (common object request broker architecture) looks promising, but most often one cannot afford its notorious inefficiency. We address the bandwidth bottleneck by extending highly efficient zero-copy communication architecture from the operating system through the middleware layers all the way to the application. In contrast to previous attempts on improving efficiency in CORBA we preserve the advantages of object oriented abstraction for the software design process and propose an efficient CORBA system that can handle bulk data transfers within the object request broker (ORB). Our prototype uses a clean separation of control-and data transfers within the ORB and for the ORB-to-ORB communication and manages to get rid of all inefficient buffering for certain types while still preserving the standard Internet interORB protocol (IIOP). It achieves the full performance that is only available with a strict zero-copy implementation across all layers between the operating system and the application.
为了获得更好的性能,许多大型应用程序需要分布式计算,促进此类应用程序开发的软件系统引起了人们的极大关注。将应用程序建模为分布式对象或组件可以带来更好的抽象和增加软件重用的好处。使用像CORBA(公共对象请求代理体系结构)这样的分布式对象中间件(DOM)看起来很有希望,但大多数情况下,人们无法承受其臭名昭著的低效率。我们通过从操作系统到中间件层一直到应用程序扩展高效的零复制通信体系结构来解决带宽瓶颈。与之前在提高CORBA效率方面的尝试不同,我们保留了面向对象抽象在软件设计过程中的优势,并提出了一个可以在对象请求代理(ORB)中处理批量数据传输的高效CORBA系统。我们的原型在ORB内部和ORB到ORB通信中使用了控制和数据传输的干净分离,并设法消除了某些类型的所有低效缓冲,同时仍然保留了标准的Internet interORB协议(IIOP)。它实现了只有在操作系统和应用程序之间的所有层中使用严格的零复制实现才能获得的完整性能。
{"title":"Zero-copy for CORBA - efficient communication for distributed object middleware","authors":"C. Kurmann, T. Stricker","doi":"10.1109/HPDC.2003.1210011","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210011","url":null,"abstract":"Many large applications require distributed computing for the sake of better performance and software systems that facilitate the development of such applications have attracted a great deal of attention. Modeling the application as distributed objects or components promises the benefits of better abstractions and increased software reuse. Using distributed object middleware (DOM) like CORBA (common object request broker architecture) looks promising, but most often one cannot afford its notorious inefficiency. We address the bandwidth bottleneck by extending highly efficient zero-copy communication architecture from the operating system through the middleware layers all the way to the application. In contrast to previous attempts on improving efficiency in CORBA we preserve the advantages of object oriented abstraction for the software design process and propose an efficient CORBA system that can handle bulk data transfers within the object request broker (ORB). Our prototype uses a clean separation of control-and data transfers within the ORB and for the ORB-to-ORB communication and manages to get rid of all inefficient buffering for certain types while still preserving the standard Internet interORB protocol (IIOP). It achieves the full performance that is only available with a strict zero-copy implementation across all layers between the operating system and the application.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116753718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
XenoSearch: distributed resource discovery in the XenoServer open platform XenoSearch: XenoServer开放平台中的分布式资源发现
David Spence, T. Harris
We describe the XenoSearch system for performing expressive resource discovery searches in a distributed environment. We represent server meta-data, such as their locations and facilities, as points in a multi-dimensional space and then express queries as predicates over these points. Each XenoSearch node holds a portion of this space and the key goal of XenoSearch is to direct queries to those nodes containing the meta-data of matching XenoServers. Communication between these XenoSearch nodes is based on the self-organizing Pastry peer-to-peer routing substrate. Our initial performance evaluation on a wide-area prototype shows that queries are only a factor of 3 to 5 times longer than basic Pastry routing, while supporting multi-dimensional searches of arbitrary shapes.
我们描述了XenoSearch系统,用于在分布式环境中执行富有表现力的资源发现搜索。我们将服务器元数据(例如它们的位置和设施)表示为多维空间中的点,然后将查询表示为这些点上的谓词。每个XenoSearch节点都持有该空间的一部分,XenoSearch的关键目标是将查询指向包含匹配xenoserver的元数据的节点。这些XenoSearch节点之间的通信基于自组织的点对点路由基板。我们在广域原型上的初步性能评估表明,查询只比基本的糕点路由长3到5倍,同时支持任意形状的多维搜索。
{"title":"XenoSearch: distributed resource discovery in the XenoServer open platform","authors":"David Spence, T. Harris","doi":"10.1109/HPDC.2003.1210031","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210031","url":null,"abstract":"We describe the XenoSearch system for performing expressive resource discovery searches in a distributed environment. We represent server meta-data, such as their locations and facilities, as points in a multi-dimensional space and then express queries as predicates over these points. Each XenoSearch node holds a portion of this space and the key goal of XenoSearch is to direct queries to those nodes containing the meta-data of matching XenoServers. Communication between these XenoSearch nodes is based on the self-organizing Pastry peer-to-peer routing substrate. Our initial performance evaluation on a wide-area prototype shows that queries are only a factor of 3 to 5 times longer than basic Pastry routing, while supporting multi-dimensional searches of arbitrary shapes.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132548307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 106
Quelling queue storms 平息排队风暴
S. Kleban, S. Clearwater
This paper characterizes "queue storms" in supercomputer systems and discusses methods for quelling them. Queue storms are anomalously large queue lengths dependent upon the job size mix, the queuing system, the machine size, and correlations and dependencies between job submissions. We use synthetic data generated from actual job log data from the ASCI Blue Mountain supercomputer combined with different long-range dependencies. We show the distribution of times from the first storm to occur, which is in a sense the time when the machine becomes obsolete because it represents the time when the machine first fails to provide satisfactory turnaround. To overcome queue storms, more resources are needed even if they appear superfluous most of the time. We present two methods, including a grid-based solution, for reducing these correlations and their resulting effect on the size and frequency of queue storms.
本文描述了超级计算机系统中“队列风暴”的特征,并讨论了抑制队列风暴的方法。队列风暴是异常大的队列长度,这取决于作业大小组合、排队系统、机器大小以及作业提交之间的相关性和依赖性。我们使用来自ASCI蓝山超级计算机的实际作业日志数据生成的合成数据,并结合不同的远程依赖关系。我们展示了从第一场风暴开始的时间分布,从某种意义上说,这是机器过时的时间,因为它代表了机器第一次无法提供令人满意的周转的时间。为了克服队列风暴,需要更多的资源,即使它们在大多数时候看起来是多余的。我们提出了两种方法,包括一种基于网格的解决方案,用于减少这些相关性及其对队列风暴大小和频率的影响。
{"title":"Quelling queue storms","authors":"S. Kleban, S. Clearwater","doi":"10.1109/HPDC.2003.1210026","DOIUrl":"https://doi.org/10.1109/HPDC.2003.1210026","url":null,"abstract":"This paper characterizes \"queue storms\" in supercomputer systems and discusses methods for quelling them. Queue storms are anomalously large queue lengths dependent upon the job size mix, the queuing system, the machine size, and correlations and dependencies between job submissions. We use synthetic data generated from actual job log data from the ASCI Blue Mountain supercomputer combined with different long-range dependencies. We show the distribution of times from the first storm to occur, which is in a sense the time when the machine becomes obsolete because it represents the time when the machine first fails to provide satisfactory turnaround. To overcome queue storms, more resources are needed even if they appear superfluous most of the time. We present two methods, including a grid-based solution, for reducing these correlations and their resulting effect on the size and frequency of queue storms.","PeriodicalId":430378,"journal":{"name":"High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124080122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1