首页 > 最新文献

2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid最新文献

英文 中文
Optimal Power Management for Server Farm to Support Green Computing
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.89
D. Niyato, Sivadon Chaisiri, Bu-Sung Lee
Green computing is a new paradigm of designing the computer system which considers not only the processing performance but also the energy efficiency. Power management is one of the approaches in green computing to reduce the power consumption in distributed computing system. In this paper, we first propose an optimal power management (OPM) used by a batch scheduler in a server farm. This OPM observes the state of a server farm and makes the decision to switch the operation mode (i.e., active or sleep) of the server to minimize the power consumption while the performance requirements are met. An optimization problem based on constrained Markov decision process (CMDP) is formulated and solved to obtain an optimal decision of OPM. Given that OPM is used in the server farm, then an assignment of users to the server farms by a job broker is considered. This assignment is to ensure that the cost due to power consumption and network transportation is minimized. The performance of the system is extensively evaluated. The result shows that with OPM the job waiting time can be maintained below the maximum threshold while the power consumption is much smaller than that without OPM.
绿色计算是一种既考虑处理性能又考虑能源效率的计算机系统设计新范式。电源管理是绿色计算中降低分布式计算系统功耗的方法之一。在本文中,我们首先提出了一种用于服务器群中的批调度程序的最优电源管理(OPM)。该OPM观察服务器群的状态,并决定切换服务器的操作模式(即活动或休眠),以便在满足性能要求的同时最大限度地减少功耗。提出了一个基于约束马尔可夫决策过程的优化问题,并对其进行了求解,得到了OPM的最优决策。假设在服务器场中使用了OPM,那么将考虑由作业代理将用户分配到服务器场。这个分配是为了确保由于电力消耗和网络传输造成的成本最小化。系统的性能得到了广泛的评估。结果表明,采用OPM时,作业等待时间可以保持在最大阈值以下,而功耗比不采用OPM时要小得多。
{"title":"Optimal Power Management for Server Farm to Support Green Computing","authors":"D. Niyato, Sivadon Chaisiri, Bu-Sung Lee","doi":"10.1109/CCGRID.2009.89","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.89","url":null,"abstract":"Green computing is a new paradigm of designing the computer system which considers not only the processing performance but also the energy efficiency. Power management is one of the approaches in green computing to reduce the power consumption in distributed computing system. In this paper, we first propose an optimal power management (OPM) used by a batch scheduler in a server farm. This OPM observes the state of a server farm and makes the decision to switch the operation mode (i.e., active or sleep) of the server to minimize the power consumption while the performance requirements are met. An optimization problem based on constrained Markov decision process (CMDP) is formulated and solved to obtain an optimal decision of OPM. Given that OPM is used in the server farm, then an assignment of users to the server farms by a job broker is considered. This assignment is to ensure that the cost due to power consumption and network transportation is minimized. The performance of the system is extensively evaluated. The result shows that with OPM the job waiting time can be maintained below the maximum threshold while the power consumption is much smaller than that without OPM.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132046064","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 75
Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids 计算网格下工作流应用的容错与调度技术
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.59
Yang Zhang, A. Mandal, C. Koelbel, K. Cooper
Complex scientific workflows are now Increasingly executed on computational grids. In addition to the challenges of managing and scheduling these workflows, reliability challenges arise because of the unreliable nature of large-scale grid infrastructure. Fault tolerance mechanisms like over-provisioning and checkpoint-recovery are used in current grid application management systems to address these reliability challenges. In this work, we propose new approaches that combine these fault tolerance techniques with existing workflow scheduling algorithms. We present a study on the effectiveness of the combined approaches by analyzing their impact on the reliability of workflow execution, workflow performance and resource usage under different reliability models, failure prediction accuracies and workflow application types.
复杂的科学工作流程现在越来越多地在计算网格上执行。除了管理和调度这些工作流的挑战之外,由于大规模网格基础设施的不可靠性,可靠性也面临挑战。当前的网格应用程序管理系统中使用了诸如过度供应和检查点恢复之类的容错机制来解决这些可靠性挑战。在这项工作中,我们提出了将这些容错技术与现有工作流调度算法相结合的新方法。通过分析不同可靠性模型、故障预测精度和工作流应用类型对工作流执行可靠性、工作流性能和资源使用的影响,研究了组合方法的有效性。
{"title":"Combined Fault Tolerance and Scheduling Techniques for Workflow Applications on Computational Grids","authors":"Yang Zhang, A. Mandal, C. Koelbel, K. Cooper","doi":"10.1109/CCGRID.2009.59","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.59","url":null,"abstract":"Complex scientific workflows are now Increasingly executed on computational grids. In addition to the challenges of managing and scheduling these workflows, reliability challenges arise because of the unreliable nature of large-scale grid infrastructure. Fault tolerance mechanisms like over-provisioning and checkpoint-recovery are used in current grid application management systems to address these reliability challenges. In this work, we propose new approaches that combine these fault tolerance techniques with existing workflow scheduling algorithms. We present a study on the effectiveness of the combined approaches by analyzing their impact on the reliability of workflow execution, workflow performance and resource usage under different reliability models, failure prediction accuracies and workflow application types.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"245 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131201308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
Distributed Indexing for Resource Discovery in P2P Networks 面向P2P网络资源发现的分布式索引
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.57
M. Hentschel, Maozhen Li, M. Ponraj, M. Qi
P2P networks facilitate people belonging to a community to share resources of interest. However, discovering resources in a large scale P2P network poses a number of challenges. Although Distributed Hash Table (DHT) structured P2P networks have shown enhanced scalability in routing messages, they only support key based exact matches. This paper presents DIndex, a distributed indexing component that can be used in P2P networks in support of range queries. DIndex introduces the concept of search dimensions for partitioning a search space, and it organizes peer nodes in a three-layered structure. Experimental results show that, for aP2P network with N number of peers, the average number of hops per message is less than log(N).
P2P网络方便了属于一个社区的人们共享感兴趣的资源。然而,在大规模的P2P网络中发现资源带来了许多挑战。尽管分布式哈希表(DHT)结构化的P2P网络在路由消息方面显示出增强的可伸缩性,但它们只支持基于键的精确匹配。本文提出了一种用于P2P网络的分布式索引组件DIndex,用于支持范围查询。索引引入了搜索维度的概念来划分搜索空间,并以三层结构组织对等节点。实验结果表明,对于具有N个对等体的aP2P网络,每条消息的平均跳数小于log(N)。
{"title":"Distributed Indexing for Resource Discovery in P2P Networks","authors":"M. Hentschel, Maozhen Li, M. Ponraj, M. Qi","doi":"10.1109/CCGRID.2009.57","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.57","url":null,"abstract":"P2P networks facilitate people belonging to a community to share resources of interest. However, discovering resources in a large scale P2P network poses a number of challenges. Although Distributed Hash Table (DHT) structured P2P networks have shown enhanced scalability in routing messages, they only support key based exact matches. This paper presents DIndex, a distributed indexing component that can be used in P2P networks in support of range queries. DIndex introduces the concept of search dimensions for partitioning a search space, and it organizes peer nodes in a three-layered structure. Experimental results show that, for aP2P network with N number of peers, the average number of hops per message is less than log(N).","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115172019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
On-Demand Resource Provisioning for BPEL Workflows Using Amazon's Elastic Compute Cloud 使用Amazon的弹性计算云为BPEL工作流提供按需资源配置
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.30
Tim Dörnemann, Ernst Juhnke, Bernd Freisleben
BPEL is the de facto standard for business process modeling in today's enterprises and is a promising candidate for the integration of business and Grid applications. Current BPEL implementations do not provide mechanisms to schedule service calls with respect to the load of the target hosts. In this paper, a solution that automatically schedules workflow steps to underutilized hosts and provides new hosts using Cloud computing infrastructures in peak-load situations is presented. The proposed approach does not require any changes to the BPEL standard.  An implementation based on the ActiveBPEL engine and Amazon's Elastic Compute Cloud is presented.
BPEL是当今企业中业务流程建模的事实上的标准,并且是集成业务和网格应用程序的一个很有前途的候选者。当前的BPEL实现没有提供根据目标主机的负载来调度服务调用的机制。本文提出了一种解决方案,可以自动将工作流步骤调度到未充分利用的主机上,并在峰值负载情况下使用云计算基础设施提供新的主机。建议的方法不需要对BPEL标准进行任何更改。提出了一个基于ActiveBPEL引擎和Amazon的弹性计算云的实现。
{"title":"On-Demand Resource Provisioning for BPEL Workflows Using Amazon's Elastic Compute Cloud","authors":"Tim Dörnemann, Ernst Juhnke, Bernd Freisleben","doi":"10.1109/CCGRID.2009.30","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.30","url":null,"abstract":"BPEL is the de facto standard for business process modeling in today's enterprises and is a promising candidate for the integration of business and Grid applications. Current BPEL implementations do not provide mechanisms to schedule service calls with respect to the load of the target hosts. In this paper, a solution that automatically schedules workflow steps to underutilized hosts and provides new hosts using Cloud computing infrastructures in peak-load situations is presented. The proposed approach does not require any changes to the BPEL standard.  An implementation based on the ActiveBPEL engine and Amazon's Elastic Compute Cloud is presented.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124004388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 142
Self-Chord: A Bio-inspired Algorithm for Structured P2P Systems 自弦:结构化P2P系统的生物启发算法
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.39
Agostino Forestiero, C. Mastroianni, M. Meo
This paper presents “Self-Chord”, a bio-inspired P2P algorithm that can be profitably adopted to build the information service of distributed systems, in particular Computational Grids and Clouds. Self-Chord inherits the ability of Chord-like structured systems for the construction and maintenance of an overlay of peers, but features enhanced functionalities deriving from the activity of ant-inspired mobile agents, such as autonomy behavior, self-organization and capacity to adapt to a changing environment. Self-Chord features three main benefits with respect to classical P2P structured systems: (i) it is possible to give a semantic meaning to keys, which enables the execution of "class" queries, often issued in Grids and Clouds; (ii) the keys are fairly distributed over the peers, thus improving the balancing of storage responsibilities; (iii) maintenance load is reduced because, as new peers join the ring, the mobile agents will spontaneously reorganize the keys in logarithmic time.
本文提出了一种生物启发的P2P算法“Self-Chord”,可以有效地用于构建分布式系统的信息服务,特别是计算网格和云。Self-Chord继承了类似chord的结构化系统的能力,用于构建和维护对等体的覆盖,但具有增强的功能,这些功能来自于反激励的移动代理的活动,如自治行为、自组织和适应不断变化的环境的能力。Self-Chord与经典的P2P结构化系统相比有三个主要优点:(i)它可以给键一个语义意义,这使得执行“类”查询成为可能,通常在网格和云中发出;(ii)密钥在对等体之间公平分配,从而改善存储责任的平衡;(iii)减少了维护负荷,因为当新的对等体加入环时,移动代理会在对数时间内自发地重新组织密钥。
{"title":"Self-Chord: A Bio-inspired Algorithm for Structured P2P Systems","authors":"Agostino Forestiero, C. Mastroianni, M. Meo","doi":"10.1109/CCGRID.2009.39","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.39","url":null,"abstract":"This paper presents “Self-Chord”, a bio-inspired P2P algorithm that can be profitably adopted to build the information service of distributed systems, in particular Computational Grids and Clouds. Self-Chord inherits the ability of Chord-like structured systems for the construction and maintenance of an overlay of peers, but features enhanced functionalities deriving from the activity of ant-inspired mobile agents, such as autonomy behavior, self-organization and capacity to adapt to a changing environment. Self-Chord features three main benefits with respect to classical P2P structured systems: (i) it is possible to give a semantic meaning to keys, which enables the execution of \"class\" queries, often issued in Grids and Clouds; (ii) the keys are fairly distributed over the peers, thus improving the balancing of storage responsibilities; (iii) maintenance load is reduced because, as new peers join the ring, the mobile agents will spontaneously reorganize the keys in logarithmic time.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126038196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Handling Persistent States in Process Checkpoint/Restart Mechanisms for HPC Systems 处理HPC系统进程检查点/重启机制中的持久状态
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.29
Pierre Riteau, A. Lèbre, C. Morin
Computer clusters are today the reference architecture for high-performance computing. The large number of nodes in these systems induces a high failure rate. This makes fault tolerance mechanisms, e.g. process checkpoint/restart, a required technology to effectively exploit clusters. Most of the process checkpoint/restart implementations only handle volatile states and do not take into account persistent states of applications, which can lead to incoherent application restarts. In this paper, we introduce an efficient persistent state checkpoint/restoration approach that can be interconnected with a large number of file systems. To avoid the performance issues of a stable support relying on synchronous replication mechanisms, we present a failure resilience scheme optimized for such persistent state checkpointing techniques in a distributed environment. First evaluations of our implementation in the kDFS distributed file system show the negligible performance impact of our proposal.
计算机集群是当今高性能计算的参考体系结构。这些系统中节点数量多,故障率高。这使得容错机制,例如进程检查点/重启,成为有效利用集群的必要技术。大多数进程检查点/重启实现只处理易变状态,而不考虑应用程序的持久状态,这可能导致不一致的应用程序重启。在本文中,我们介绍了一种有效的持久状态检查点/恢复方法,该方法可以与大量文件系统相互连接。为了避免依赖于同步复制机制的稳定支持的性能问题,我们提出了一种针对分布式环境中这种持久状态检查点技术进行优化的故障恢复方案。对kDFS分布式文件系统实现的首次评估表明,我们的建议对性能的影响可以忽略不计。
{"title":"Handling Persistent States in Process Checkpoint/Restart Mechanisms for HPC Systems","authors":"Pierre Riteau, A. Lèbre, C. Morin","doi":"10.1109/CCGRID.2009.29","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.29","url":null,"abstract":"Computer clusters are today the reference architecture for high-performance computing. The large number of nodes in these systems induces a high failure rate. This makes fault tolerance mechanisms, e.g. process checkpoint/restart, a required technology to effectively exploit clusters. Most of the process checkpoint/restart implementations only handle volatile states and do not take into account persistent states of applications, which can lead to incoherent application restarts. In this paper, we introduce an efficient persistent state checkpoint/restoration approach that can be interconnected with a large number of file systems. To avoid the performance issues of a stable support relying on synchronous replication mechanisms, we present a failure resilience scheme optimized for such persistent state checkpointing techniques in a distributed environment. First evaluations of our implementation in the kDFS distributed file system show the negligible performance impact of our proposal.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128483647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Hierarchical Caches for Grid Workflows 网格工作流的分层缓存
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.10
David Chiu, G. Agrawal
From personal software to advanced systems, caching mechanisms have steadfastly been a ubiquitous means for reducing workloads. It is no surprise, then, that under the grid and cluster paradigms, middlewares and other large-scale applications often seek caching solutions. Among these distributed applications, scientific workflow management systems have gained ground towards mitigating the often painstaking process of composing sequences of scientific data sets and services to derive virtual data. In the past, workflow managers have relied on low-level system cache for reuse support. But in distributed query intensive environments, where high volumes of intermediate virtual data can potentially be stored anywhere on the grid, a novel cache structure is needed to efficiently facilitate workflow planning. In this paper, we describe an approach to combat the challenges of maintaining large, fast virtual data caches for workflow composition. A hierarchical structure is proposed for indexing scientific data with spatiotemporal annotations across grid nodes. Our experimental results show that our hierarchical index is scalable and outperforms a centralized indexing scheme by an exponential factor in query intensive environments.
从个人软件到高级系统,缓存机制一直是减少工作负载的普遍手段。因此,在网格和集群范例下,中间件和其他大规模应用程序经常寻求缓存解决方案也就不足为奇了。在这些分布式应用程序中,科学工作流管理系统在减轻组合科学数据集序列和服务以派生虚拟数据的通常艰苦的过程方面取得了进展。在过去,工作流管理器依赖于低级系统缓存来支持重用。但是在分布式查询密集型环境中,大量中间虚拟数据可能存储在网格的任何位置,因此需要一种新的缓存结构来有效地促进工作流规划。在本文中,我们描述了一种方法来应对维护工作流组成的大型、快速虚拟数据缓存的挑战。提出了一种利用网格节点间的时空注释对科学数据进行索引的分层结构。实验结果表明,我们的分级索引具有可扩展性,并且在查询密集型环境中优于集中式索引方案。
{"title":"Hierarchical Caches for Grid Workflows","authors":"David Chiu, G. Agrawal","doi":"10.1109/CCGRID.2009.10","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.10","url":null,"abstract":"From personal software to advanced systems, caching mechanisms have steadfastly been a ubiquitous means for reducing workloads. It is no surprise, then, that under the grid and cluster paradigms, middlewares and other large-scale applications often seek caching solutions. Among these distributed applications, scientific workflow management systems have gained ground towards mitigating the often painstaking process of composing sequences of scientific data sets and services to derive virtual data. In the past, workflow managers have relied on low-level system cache for reuse support. But in distributed query intensive environments, where high volumes of intermediate virtual data can potentially be stored anywhere on the grid, a novel cache structure is needed to efficiently facilitate workflow planning. In this paper, we describe an approach to combat the challenges of maintaining large, fast virtual data caches for workflow composition. A hierarchical structure is proposed for indexing scientific data with spatiotemporal annotations across grid nodes. Our experimental results show that our hierarchical index is scalable and outperforms a centralized indexing scheme by an exponential factor in query intensive environments.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126804306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Self-Tuning Virtual Machines for Predictable eScience 面向可预测科学的自调优虚拟机
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.84
Sang-Min Park, M. Humphrey
Unpredictable access to batch-mode HPC resources is a significant problem for emerging dynamic data-driven applications. Although efforts such as reservation or queue-time prediction have attempted to partially address this problem, the approaches strictly based on space-sharing impose fundamental limits on real-time predictability. In contrast, our earlier work investigated the use of feedback-controlled virtual machines (VMs), a time-sharing approach, to deliver predictable execution. However, our earlier work did not fully address usability and implementation efficiency. This paper presents an online, software-only version of feedback controlled VM, called self-tuning VM, which we argue is a practical approach for predictable HPC infrastructure. Our evaluation using five widely-used applications show our approach is both predictable and practical: by simply running time-dependent jobs with our tool, we meet a job’s deadline typically within 3% errors, and within 8% errors for the more challenging applications.
对批处理模式HPC资源的不可预测访问是新兴的动态数据驱动应用程序的一个重要问题。尽管诸如预订或排队时间预测之类的努力试图部分地解决这个问题,但严格基于空间共享的方法对实时可预测性施加了根本性的限制。相比之下,我们早期的工作调查了反馈控制的虚拟机(vm)的使用,一种分时方法,以提供可预测的执行。然而,我们早期的工作并没有完全解决可用性和实现效率问题。本文提出了一种在线的、仅软件版本的反馈控制VM,称为自调优VM,我们认为这是一种可预测的HPC基础设施的实用方法。我们对五个广泛使用的应用程序进行了评估,结果表明我们的方法既可预测又实用:通过简单地使用我们的工具运行与时间相关的作业,我们通常在3%的错误内完成作业的截止日期,对于更具挑战性的应用程序,我们的错误在8%以内。
{"title":"Self-Tuning Virtual Machines for Predictable eScience","authors":"Sang-Min Park, M. Humphrey","doi":"10.1109/CCGRID.2009.84","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.84","url":null,"abstract":"Unpredictable access to batch-mode HPC resources is a significant problem for emerging dynamic data-driven applications. Although efforts such as reservation or queue-time prediction have attempted to partially address this problem, the approaches strictly based on space-sharing impose fundamental limits on real-time predictability. In contrast, our earlier work investigated the use of feedback-controlled virtual machines (VMs), a time-sharing approach, to deliver predictable execution. However, our earlier work did not fully address usability and implementation efficiency. This paper presents an online, software-only version of feedback controlled VM, called self-tuning VM, which we argue is a practical approach for predictable HPC infrastructure. Our evaluation using five widely-used applications show our approach is both predictable and practical: by simply running time-dependent jobs with our tool, we meet a job’s deadline typically within 3% errors, and within 8% errors for the more challenging applications.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129225704","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
GenLM: License Management for Grid and Cloud Computing Environments GenLM:网格和云计算环境的许可证管理
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.31
M. Dalheimer, F. Pfreundt
Software license management allows independent software vendors (ISVs) to  control the access of their products. It is a fundamental part of the ISVs' business strategy. A wide range of products has been developed in order to address license management. There are, however, only few ongoing works with regard to license management in grid and cloud  computing environments. This paper presents our work on GenLM, a license management solution suitable for these environments. It has been built in order to provide a secure and robust solution for ISVs that want to extend their software usage to these systems. We provide ISVs a toolchain to implement arbitrary software licensing models. At the same time we ensure that licenses are mobile, i.e. they can be used on any resource the user has access to.
软件许可证管理允许独立软件供应商(isv)控制对其产品的访问。它是独立软件开发商业务战略的基本组成部分。为了解决许可证管理问题,已经开发了各种各样的产品。然而,在网格和云计算环境中,只有少数正在进行的关于许可证管理的工作。本文介绍了我们在GenLM上的工作,这是一个适合这些环境的许可证管理解决方案。它的构建是为了为那些希望将其软件使用扩展到这些系统的isv提供一个安全和健壮的解决方案。我们为isv提供了一个工具链来实现任意软件许可模型。同时,我们确保许可证是可移动的,也就是说,它们可以用于用户可以访问的任何资源。
{"title":"GenLM: License Management for Grid and Cloud Computing Environments","authors":"M. Dalheimer, F. Pfreundt","doi":"10.1109/CCGRID.2009.31","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.31","url":null,"abstract":"Software license management allows independent software vendors (ISVs) to  control the access of their products. It is a fundamental part of the ISVs' business strategy. A wide range of products has been developed in order to address license management. There are, however, only few ongoing works with regard to license management in grid and cloud  computing environments. This paper presents our work on GenLM, a license management solution suitable for these environments. It has been built in order to provide a secure and robust solution for ISVs that want to extend their software usage to these systems. We provide ISVs a toolchain to implement arbitrary software licensing models. At the same time we ensure that licenses are mobile, i.e. they can be used on any resource the user has access to.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"843 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117332197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
File Clustering Based Replication Algorithm in a Grid Environment 网格环境下基于文件聚类的复制算法
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.73
Hitoshi Sato, S. Matsuoka, Toshio Endo
Replication in grid file systems can significantly improve I/O performance of data-intensive applications. However, most of existing replication techniques apply to individual files, which may introduce inefficient replication overheads for a large number of files. We propose a file clustering based replication algorithm for grid file systems. Our algorithm groups files according to a relationship of simultaneous accesses between files and stores replicas of the clustered files into storage nodes, to satisfy expected most of future read access times to the clustered files and replication times for individual files being minimized under the given storage capacity limitation. Our experiments on a given grid environment, 20 nodes of 5 sites, suggest that the proposed algorithm achieves accurate file clustering and efficient replica management; our clustering policy with the file cluster size limit of 5120 MB and the storage capacity limit for replicas of 10240 MB exhibits 1.58 times efficiency than the policy that never groups related files. The results also indicate that the overheads required for introducing our algorithm significantly affect I/O performance of running applications.
网格文件系统中的复制可以显著提高数据密集型应用程序的I/O性能。但是,大多数现有的复制技术都适用于单个文件,这可能会为大量文件带来低效的复制开销。提出了一种基于文件集群的网格文件系统复制算法。我们的算法根据文件之间的同时访问关系对文件进行分组,并将集群文件的副本存储到存储节点中,以满足在给定存储容量限制下期望的对集群文件的大部分未来读访问次数和单个文件的复制次数最小化。在给定网格环境下,5个站点20个节点的实验表明,该算法实现了准确的文件聚类和高效的副本管理;文件集群大小限制为5120 MB,副本存储容量限制为10240 MB的集群策略的效率是不分组相关文件的策略的1.58倍。结果还表明,引入我们的算法所需的开销会显著影响正在运行的应用程序的I/O性能。
{"title":"File Clustering Based Replication Algorithm in a Grid Environment","authors":"Hitoshi Sato, S. Matsuoka, Toshio Endo","doi":"10.1109/CCGRID.2009.73","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.73","url":null,"abstract":"Replication in grid file systems can significantly improve I/O performance of data-intensive applications. However, most of existing replication techniques apply to individual files, which may introduce inefficient replication overheads for a large number of files. We propose a file clustering based replication algorithm for grid file systems. Our algorithm groups files according to a relationship of simultaneous accesses between files and stores replicas of the clustered files into storage nodes, to satisfy expected most of future read access times to the clustered files and replication times for individual files being minimized under the given storage capacity limitation. Our experiments on a given grid environment, 20 nodes of 5 sites, suggest that the proposed algorithm achieves accurate file clustering and efficient replica management; our clustering policy with the file cluster size limit of 5120 MB and the storage capacity limit for replicas of 10240 MB exhibits 1.58 times efficiency than the policy that never groups related files. The results also indicate that the overheads required for introducing our algorithm significantly affect I/O performance of running applications.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"53 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126730950","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
期刊
2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1