首页 > 最新文献

2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid最新文献

英文 中文
Extending Pastry by an Alphanumerical Overlay 通过字母数字叠加扩展糕点
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.65
Dominic Battré, A. Höing, Martin Raack, Ulf Rerrer-Brusch, O. Kao
Many load balancing strategies have been proposed for distributed hash tables, like Pastry. These strategies assume that hash functions spread even skewed key distributions almost evenly over the ID space. They neglect the problem that many applications produce data with common keys (multi-sets) that entail hash collisions and therewith load imbalance concerning query and storage load. A second drawback of using hash functions in DHTs is the lack of range queries needed in many scenarios. This paper presents a solution for how to use the routing structure of the P2P network Pastry to create a new alphanumerical overlay with very little additional costs. This overlay is capable of storing data in a totally ordered manner instead of using hashed keys. Therewith, it enables range queries and sophisticated load balancing. We discuss the impact on Pastry that arises when nodes are relocated during load balancing. This possibly causes a skewed distribution of nodes in the circular id space. We demonstrate the feasibility of our idea including advantages and problems through an evaluation of simulations.
许多针对分布式散列表的负载平衡策略已经被提出,比如Pastry。这些策略假设散列函数在ID空间中几乎均匀地分布歪斜的键分布。它们忽略了一个问题,即许多应用程序产生具有公共键(多集)的数据,这会导致哈希冲突,从而导致查询和存储负载的负载不平衡。在dht中使用散列函数的第二个缺点是缺乏许多场景所需的范围查询。本文提出了一种利用P2P网络的路由结构以很少的额外成本创建新的字母数字覆盖层的解决方案。这种覆盖能够以完全有序的方式存储数据,而不是使用散列键。因此,它支持范围查询和复杂的负载平衡。我们将讨论在负载平衡期间重新定位节点时对Pastry的影响。这可能会导致圆形id空间中节点的倾斜分布。通过仿真评估,论证了该方法的可行性,包括优点和存在的问题。
{"title":"Extending Pastry by an Alphanumerical Overlay","authors":"Dominic Battré, A. Höing, Martin Raack, Ulf Rerrer-Brusch, O. Kao","doi":"10.1109/CCGRID.2009.65","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.65","url":null,"abstract":"Many load balancing strategies have been proposed for distributed hash tables, like Pastry. These strategies assume that hash functions spread even skewed key distributions almost evenly over the ID space. They neglect the problem that many applications produce data with common keys (multi-sets) that entail hash collisions and therewith load imbalance concerning query and storage load. A second drawback of using hash functions in DHTs is the lack of range queries needed in many scenarios. This paper presents a solution for how to use the routing structure of the P2P network Pastry to create a new alphanumerical overlay with very little additional costs. This overlay is capable of storing data in a totally ordered manner instead of using hashed keys. Therewith, it enables range queries and sophisticated load balancing. We discuss the impact on Pastry that arises when nodes are relocated during load balancing. This possibly causes a skewed distribution of nodes in the circular id space. We demonstrate the feasibility of our idea including advantages and problems through an evaluation of simulations.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"198 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116692629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Evaluating the Divisible Load Assumption in the Context of Economic Grid Scheduling with Deadline-Based QoS guarantees 基于最后期限的QoS保证下经济网格调度中可分负荷假设的评估
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.36
Wim Depoorter, R. Y. V. Bossche, K. Vanmechelen, J. Broeckhove
The ef¿cient scheduling of jobs is an essential part of any grid resource management system. At its core, it involves ¿nding a solution to a problem which is NP-complete by reduction to the knapsack problem. Consequently, this problem is often tackled by using heuristics to derive a more pragmatic solution. Other than the use of heuristics, simpli¿cations and abstractions of the workload model may also be employed to increase the tractability of the scheduling problem. A possible abstraction in this context is the use of Divisible Load Theory (DLT), in which it is assumed that an application consists of an arbitrarily divisible load (ADL). Many applications however, are composed of a number of atomic tasks and are only modularly divisible. In this paper we evaluate the consequences of the ADL assumption on the performance of economic scheduling approaches for grids, in the context of CPU-bound modularly divisible applications with hard deadlines. Our goal is to evaluate to what extent DLT can still serve as a useful workload abstraction for obtaining tractable scheduling algorithms in this setting. The focus of our evaluation is on the recently proposed tsfGrid heuristic for economic scheduling of grid workloads which operates under the assumptions of ADL. We demonstrate the effect of the ADL assumption on the actual instantiation of schedules and on the user value realized by the RMS. In addition we describe how the usage of a DLT heuristic in a high-level admission controller for a mechanism which does take into account the atomicity of individual tasks, can signi¿cantly reduce communication and computational overhead.
高效的作业调度是任何网格资源管理系统的重要组成部分。在其核心,它涉及到一个问题的解决方案,这个问题是np完全的,通过简化为背包问题。因此,这个问题通常是通过使用启发式来获得更实用的解决方案来解决的。除了使用启发式方法外,还可以使用工作负载模型的简化和抽象来增加调度问题的可跟踪性。在这种情况下,一个可能的抽象是使用可分负载理论(DLT),其中假设应用程序由任意可分负载(ADL)组成。然而,许多应用程序是由许多原子任务组成的,并且只能模块化地分割。在本文中,我们评估了ADL假设对网格经济调度方法性能的影响,在具有硬截止日期的cpu绑定模块化可分应用程序的背景下。我们的目标是评估在这种情况下,DLT在多大程度上仍然可以作为一种有用的工作负载抽象来获得可处理的调度算法。我们评估的重点是最近提出的tsfGrid启发式算法,用于在ADL假设下运行的网格工作负载的经济调度。我们论证了ADL假设对调度的实际实例化和RMS实现的用户价值的影响。此外,我们还描述了如何在高级准入控制器中使用DLT启发式,以实现考虑到单个任务原子性的机制,从而显着减少通信和计算开销。
{"title":"Evaluating the Divisible Load Assumption in the Context of Economic Grid Scheduling with Deadline-Based QoS guarantees","authors":"Wim Depoorter, R. Y. V. Bossche, K. Vanmechelen, J. Broeckhove","doi":"10.1109/CCGRID.2009.36","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.36","url":null,"abstract":"The ef¿cient scheduling of jobs is an essential part of any grid resource management system. At its core, it involves ¿nding a solution to a problem which is NP-complete by reduction to the knapsack problem. Consequently, this problem is often tackled by using heuristics to derive a more pragmatic solution. Other than the use of heuristics, simpli¿cations and abstractions of the workload model may also be employed to increase the tractability of the scheduling problem. A possible abstraction in this context is the use of Divisible Load Theory (DLT), in which it is assumed that an application consists of an arbitrarily divisible load (ADL). Many applications however, are composed of a number of atomic tasks and are only modularly divisible. In this paper we evaluate the consequences of the ADL assumption on the performance of economic scheduling approaches for grids, in the context of CPU-bound modularly divisible applications with hard deadlines. Our goal is to evaluate to what extent DLT can still serve as a useful workload abstraction for obtaining tractable scheduling algorithms in this setting. The focus of our evaluation is on the recently proposed tsfGrid heuristic for economic scheduling of grid workloads which operates under the assumptions of ADL. We demonstrate the effect of the ADL assumption on the actual instantiation of schedules and on the user value realized by the RMS. In addition we describe how the usage of a DLT heuristic in a high-level admission controller for a mechanism which does take into account the atomicity of individual tasks, can signi¿cantly reduce communication and computational overhead.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130208296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A Novel Collaborative Grid Framework for Distributed Healthcare 分布式医疗保健的新型协同网格框架
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.74
Hoang M. Phung, D. Hoang, E. Lawrence
The increasing ageing population around the world presents an unprecedented challenge in developing economically viable healthcare systems for our societies. To overcome this challenge and maintain a decent level of healthcare, further innovation and deployment of information technologies in the field of healthcare are the keys. Our earlier work puts forward "An Active Grid Infrastructure for Elderly Care" solution whereby elderly people are monitored non-intrusively to provide essential health information about them; the information concerning their condition is kept in an "active" health record; and resources involved in the system can be shared by multiple caregivers or organizations allowing them to collaborate in developing relevant care programs for these elderly persons. This paper focuses on the design of the collaborative component of the solution. A framework is proposed for an integrated Grid system that supports collaborative task planning and workflow development. The system will enable healthcare professionals to access the necessary resources, collaborate with each other to develop and execute treatment workflows in a user-friendly and intuitive manner. The paper will discuss the requirements and design principles for key components of the system and present its preliminary implementation.
世界各地日益增长的老龄化人口为我们的社会发展经济上可行的医疗保健系统提出了前所未有的挑战。要克服这一挑战并保持良好的医疗保健水平,在医疗保健领域进一步创新和部署信息技术是关键。我们早期的工作提出了“老年人护理的主动网格基础设施”解决方案,即对老年人进行非侵入性监测,提供有关他们的基本健康信息;有关其状况的信息保存在"活跃"健康记录中;系统中涉及的资源可以由多个护理人员或组织共享,使他们能够合作为这些老年人制定相关的护理计划。本文的重点是解决方案的协作组件的设计。提出了一个支持协同任务规划和工作流开发的集成网格系统框架。该系统将使医疗保健专业人员能够访问必要的资源,相互协作,以用户友好和直观的方式开发和执行治疗工作流程。本文将讨论系统关键组件的需求和设计原则,并给出系统的初步实现。
{"title":"A Novel Collaborative Grid Framework for Distributed Healthcare","authors":"Hoang M. Phung, D. Hoang, E. Lawrence","doi":"10.1109/CCGRID.2009.74","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.74","url":null,"abstract":"The increasing ageing population around the world presents an unprecedented challenge in developing economically viable healthcare systems for our societies. To overcome this challenge and maintain a decent level of healthcare, further innovation and deployment of information technologies in the field of healthcare are the keys. Our earlier work puts forward \"An Active Grid Infrastructure for Elderly Care\" solution whereby elderly people are monitored non-intrusively to provide essential health information about them; the information concerning their condition is kept in an \"active\" health record; and resources involved in the system can be shared by multiple caregivers or organizations allowing them to collaborate in developing relevant care programs for these elderly persons. This paper focuses on the design of the collaborative component of the solution. A framework is proposed for an integrated Grid system that supports collaborative task planning and workflow development. The system will enable healthcare professionals to access the necessary resources, collaborate with each other to develop and execute treatment workflows in a user-friendly and intuitive manner. The paper will discuss the requirements and design principles for key components of the system and present its preliminary implementation.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128236867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Minimizing Energy Consumption for Precedence-Constrained Applications Using Dynamic Voltage Scaling 使用动态电压缩放最小化优先级约束应用的能耗
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.16
Young Choon Lee, Albert Y. Zomaya
Jobs on high-performance computing systems are deployed mostly with the sole goal of minimizing completion times. This performance demand has been satisfied without paying much attention to power/energy consumption. Consequently, that has become a major concern in high-performance computing systems. In this paper, we address the problem of scheduling precedence-constrained parallel applications on such systems—specifically with heterogeneous resources—accounting for both application completion time and energy consumption. Our scheduling algorithm adopts dynamic voltage scaling (DVS) to minimize energy consumption. DVS can be used with a number of recent commodity processors that are enabled to operate in different voltage supply levels at the expense of sacrificing clock frequencies. In the context of scheduling, this multiple voltage facility implies that there is a trade-off between the quality of schedules and energy consumption. Our algorithm effectively balances these two performance goals using a novel objective function, which takes into account both goals; this claim is verified by the results obtained from our extensive comparative evaluation study.
部署在高性能计算系统上的作业的唯一目标是最小化完成时间。这种性能需求已经得到了满足,而无需过多关注功率/能源消耗。因此,这已成为高性能计算系统的主要关注点。在本文中,我们解决了调度优先级受限的并行应用程序在这样的系统上的问题-特别是异构资源-考虑应用程序完成时间和能量消耗。我们的调度算法采用动态电压缩放(DVS)来最小化能耗。DVS可以与许多最近的商品处理器一起使用,这些处理器可以在不同的电压供应水平下工作,但代价是牺牲时钟频率。在调度的上下文中,这种多电压设施意味着在调度的质量和能源消耗之间存在权衡。我们的算法使用一个新的目标函数有效地平衡了这两个性能目标,该目标函数同时考虑了这两个目标;这一说法得到了我们广泛的比较评估研究结果的证实。
{"title":"Minimizing Energy Consumption for Precedence-Constrained Applications Using Dynamic Voltage Scaling","authors":"Young Choon Lee, Albert Y. Zomaya","doi":"10.1109/CCGRID.2009.16","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.16","url":null,"abstract":"Jobs on high-performance computing systems are deployed mostly with the sole goal of minimizing completion times. This performance demand has been satisfied without paying much attention to power/energy consumption. Consequently, that has become a major concern in high-performance computing systems. In this paper, we address the problem of scheduling precedence-constrained parallel applications on such systems—specifically with heterogeneous resources—accounting for both application completion time and energy consumption. Our scheduling algorithm adopts dynamic voltage scaling (DVS) to minimize energy consumption. DVS can be used with a number of recent commodity processors that are enabled to operate in different voltage supply levels at the expense of sacrificing clock frequencies. In the context of scheduling, this multiple voltage facility implies that there is a trade-off between the quality of schedules and energy consumption. Our algorithm effectively balances these two performance goals using a novel objective function, which takes into account both goals; this claim is verified by the results obtained from our extensive comparative evaluation study.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"109 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121420815","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 172
Reliable P2P Feed Delivery 可靠的P2P馈送
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.32
Anwitaman Datta, Xin Liu
Using peer-to-peer overlays to notify users whenevera new update occurs is a promising approach to supportweb based publish subscribe systems like really simple syndication (RSS). Such a peer-to-peer approach can scale well by reducing load at the source and also guarantee timeliness of notifications. Several such overlay based approaches have been proposed in recent years. However, malicious peers may pretend to relay but actually not, and thus deny service, or even propagate counterfeit updates - thus rendering a peer-to-peer mechanism not only useless, but even harmful (e.g., by false updates). We propose overlay independent randomized strategies to mitigate these ill-effects of malicious peers at a marginal overhead, thus enjoying the benefits of peer-to-peer dissemination, along with the assurance of content integrity in RSS like web-based publish-subscribe applications without altering currently deployed server infrastructure.
每当有新的更新发生时,使用点对点覆盖通知用户是支持基于web的发布订阅系统(如真正简单的联合(RSS))的一种很有前途的方法。这种点对点方法可以通过减少源端的负载来很好地扩展,并保证通知的及时性。近年来提出了几种基于覆盖的方法。然而,恶意的对等体可能会假装中继,但实际上并没有,从而拒绝服务,甚至传播伪造的更新-从而使点对点机制不仅无用,甚至有害(例如,通过虚假更新)。我们提出了覆盖独立的随机策略,以边际开销减轻恶意对等的这些不良影响,从而享受点对点传播的好处,以及在RSS(如基于web的发布-订阅应用程序)中保证内容完整性,而无需改变当前部署的服务器基础设施。
{"title":"Reliable P2P Feed Delivery","authors":"Anwitaman Datta, Xin Liu","doi":"10.1109/CCGRID.2009.32","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.32","url":null,"abstract":"Using peer-to-peer overlays to notify users whenevera new update occurs is a promising approach to supportweb based publish subscribe systems like really simple syndication (RSS). Such a peer-to-peer approach can scale well by reducing load at the source and also guarantee timeliness of notifications. Several such overlay based approaches have been proposed in recent years. However, malicious peers may pretend to relay but actually not, and thus deny service, or even propagate counterfeit updates - thus rendering a peer-to-peer mechanism not only useless, but even harmful (e.g., by false updates). We propose overlay independent randomized strategies to mitigate these ill-effects of malicious peers at a marginal overhead, thus enjoying the benefits of peer-to-peer dissemination, along with the assurance of content integrity in RSS like web-based publish-subscribe applications without altering currently deployed server infrastructure.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130542557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Markov Model Based Disk Power Management for Data Intensive Workloads 基于马尔可夫模型的数据密集型工作负载磁盘电源管理
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.67
R. Garg, S. Son, M. Kandemir, P. Raghavan, R. Prabhakar
In order to meet the increasing demands of present and upcoming data-intensive computer applications, there has been a major shift in the disk subsystem, which now consists of more disks with higher storage capacities and higher rotational speeds. These have made the disk subsystem a major consumer of power, making disk power management an important issue. People have considered the option of spinning down the disk during periods of idleness or serving the requests at lower rotational speeds when performance is not an issue. Accurately predicting future disk idle periods is crucial to such schemes. This paper presents a novel disk-idleness prediction mechanism based on Markov models and explains how this mechanism can be used in conjunction with a three-speed disk. Our experimental evaluation using a diverse set of workloads indicates that (i) prediction accuracies achieved by the proposed scheme are very good (87.5% on average); (ii) it generates significant energy savings over the traditional power-saving method of spinning down the disk when idle (35.5% onaverage); (iii) it performs better than a previously proposed multi-speed disk management scheme (19% on average); and (iv) the performance penalty is negligible (less than 1% on average). Overall, our implementation and experimental evaluation using both synthetic disk traces and traces extracted from real applications demonstrate the feasibility of a Markov-model-based approach to saving disk power.
为了满足当前和即将到来的数据密集型计算机应用日益增长的需求,磁盘子系统已经发生了重大转变,它现在由更多具有更高存储容量和更高转速的磁盘组成。这使得磁盘子系统成为一个主要的电源消耗者,使得磁盘电源管理成为一个重要的问题。人们考虑过在空闲期间降低磁盘的旋转速度,或者在性能不是问题时以较低的旋转速度处理请求。准确预测未来的磁盘空闲期对于此类方案至关重要。本文提出了一种基于马尔可夫模型的磁盘空闲预测机制,并解释了该机制如何与三速磁盘结合使用。我们使用不同工作负载的实验评估表明:(i)所提出的方案实现的预测精度非常好(平均为87.5%);(ii)与传统的在空闲时关闭磁盘的节电方法相比,它能显著节省能源(平均节省35.5%);(iii)性能优于先前提出的多速磁盘管理方案(平均19%);(iv)性能损失可以忽略不计(平均小于1%)。总的来说,我们使用合成磁盘轨迹和从实际应用中提取轨迹的实现和实验评估证明了基于马尔可夫模型的方法节省磁盘功率的可行性。
{"title":"Markov Model Based Disk Power Management for Data Intensive Workloads","authors":"R. Garg, S. Son, M. Kandemir, P. Raghavan, R. Prabhakar","doi":"10.1109/CCGRID.2009.67","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.67","url":null,"abstract":"In order to meet the increasing demands of present and upcoming data-intensive computer applications, there has been a major shift in the disk subsystem, which now consists of more disks with higher storage capacities and higher rotational speeds. These have made the disk subsystem a major consumer of power, making disk power management an important issue. People have considered the option of spinning down the disk during periods of idleness or serving the requests at lower rotational speeds when performance is not an issue. Accurately predicting future disk idle periods is crucial to such schemes. This paper presents a novel disk-idleness prediction mechanism based on Markov models and explains how this mechanism can be used in conjunction with a three-speed disk. Our experimental evaluation using a diverse set of workloads indicates that (i) prediction accuracies achieved by the proposed scheme are very good (87.5% on average); (ii) it generates significant energy savings over the traditional power-saving method of spinning down the disk when idle (35.5% onaverage); (iii) it performs better than a previously proposed multi-speed disk management scheme (19% on average); and (iv) the performance penalty is negligible (less than 1% on average). Overall, our implementation and experimental evaluation using both synthetic disk traces and traces extracted from real applications demonstrate the feasibility of a Markov-model-based approach to saving disk power.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130899968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Performance under Failures of DAG-based Parallel Computing 基于dag的并行计算故障下的性能研究
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.55
Hui Jin, Xian-He Sun, Ziming Zheng, Z. Lan, Bing Xie
As the scale and complexity of parallel systems continue to grow, failures become more and more an inevitable fact for solving large-scale applications. In this research, we present an analytical study to estimate execution time in the presence of failures of directed acyclic graph (DAG) based Scientific Applications and provide a guideline for performance optimization. The study is four fold. We first introduce a performance model to predict individual subtask computation time under failures. Next, a layered, iterative approach is adopted to transform a DAG into a layered DAG, which reflects full dependencies among all the subtasks. Then, the expected execution time under failures of the DAG is derived based on stochastic analysis. Unlike existing models, this newly proposed performance model provides both the variance and distribution. It is practical and can be put to real use. Finally, based on the model, performance optimization, weak point identification and enhancement are proposed. Intensive simulations with real system traces are conducted to verify the analytical findings. They show that the newly proposed model and weak point enhancement mechanism work well.
随着并行系统的规模和复杂性不断增长,故障越来越成为解决大规模应用的一个不可避免的事实。在本研究中,我们提出了一项分析研究,以估计基于有向无环图(DAG)的科学应用程序在存在故障时的执行时间,并为性能优化提供指导。这项研究分为四部分。我们首先引入了一个性能模型来预测故障情况下单个子任务的计算时间。接下来,采用分层迭代方法将DAG转换为反映所有子任务之间完全依赖关系的分层DAG。然后,基于随机分析,导出了DAG在故障情况下的期望执行时间。与现有模型不同,新提出的性能模型同时提供方差和分布。它是实用的,可以投入实际使用。最后,在此基础上进行了性能优化、弱点识别和增强。利用真实系统轨迹进行了密集模拟,以验证分析结果。结果表明,新提出的模型和弱点增强机制运行良好。
{"title":"Performance under Failures of DAG-based Parallel Computing","authors":"Hui Jin, Xian-He Sun, Ziming Zheng, Z. Lan, Bing Xie","doi":"10.1109/CCGRID.2009.55","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.55","url":null,"abstract":"As the scale and complexity of parallel systems continue to grow, failures become more and more an inevitable fact for solving large-scale applications. In this research, we present an analytical study to estimate execution time in the presence of failures of directed acyclic graph (DAG) based Scientific Applications and provide a guideline for performance optimization. The study is four fold. We first introduce a performance model to predict individual subtask computation time under failures. Next, a layered, iterative approach is adopted to transform a DAG into a layered DAG, which reflects full dependencies among all the subtasks. Then, the expected execution time under failures of the DAG is derived based on stochastic analysis. Unlike existing models, this newly proposed performance model provides both the variance and distribution. It is practical and can be put to real use. Finally, based on the model, performance optimization, weak point identification and enhancement are proposed. Intensive simulations with real system traces are conducted to verify the analytical findings. They show that the newly proposed model and weak point enhancement mechanism work well.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130935193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Grid-Based Sleep Research: Analysis of Polysomnographies Using a Grid Infrastructure 基于网格的睡眠研究:使用网格基础设施的多导睡眠图分析
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.52
D. Krefting, S. Canisius, A. Hoheisel, H. Loose, T. Tolxdorff, T. Penzel
The analysis of biosignals, such as the electroencephalogram EEG or the electrocardiogram (ECG), is essential for diagnosis in many medical areas, in particular sleep medicine and sleep research. A standard method in this field is the polysomnography, a multidimensional biosignal recording during the whole bedtime phase. Within the SIESTA project, a European multicenter study, comprehensive clinical and polysommnographic records from over 300 persons has been collected. To make the data available for researchers as reference for clinical research and development of new analysis tools, the SIESTA database is implemented into a grid infrastructure. To date, the complete data is stored into the grid and different algorithms for automated ECG analysis are implemented. The database can be queried and the matching data can be analysed on record level and collection level. The application is modelled as a workflow and integrated into the grid using a workflow manager. A graphical user interface is implemented as a grid portlet. It allows the initialization of new computation tasks as well as the monitoring and result-retrieval from already launched analyses.
生物信号的分析,如脑电图(EEG)或心电图(ECG),在许多医学领域的诊断是必不可少的,特别是睡眠医学和睡眠研究。该领域的标准方法是多导睡眠描记术,这是一种在整个就寝阶段记录多维生物信号的方法。在SIESTA项目中,一个欧洲多中心研究,收集了300多人的综合临床和多导睡眠记录。为了使研究人员可以使用数据作为临床研究和开发新分析工具的参考,SIESTA数据库被实现到网格基础设施中。迄今为止,完整的数据存储在网格中,并实现了不同的自动心电分析算法。可以对数据库进行查询,并在记录级和收集级对匹配数据进行分析。应用程序被建模为工作流,并使用工作流管理器集成到网格中。图形用户界面作为网格portlet实现。它允许初始化新的计算任务,以及监视和从已经启动的分析中检索结果。
{"title":"Grid-Based Sleep Research: Analysis of Polysomnographies Using a Grid Infrastructure","authors":"D. Krefting, S. Canisius, A. Hoheisel, H. Loose, T. Tolxdorff, T. Penzel","doi":"10.1109/CCGRID.2009.52","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.52","url":null,"abstract":"The analysis of biosignals, such as the electroencephalogram EEG or the electrocardiogram (ECG), is essential for diagnosis in many medical areas, in particular sleep medicine and sleep research. A standard method in this field is the polysomnography, a multidimensional biosignal recording during the whole bedtime phase. Within the SIESTA project, a European multicenter study, comprehensive clinical and polysommnographic records from over 300 persons has been collected. To make the data available for researchers as reference for clinical research and development of new analysis tools, the SIESTA database is implemented into a grid infrastructure. To date, the complete data is stored into the grid and different algorithms for automated ECG analysis are implemented. The database can be queried and the matching data can be analysed on record level and collection level. The application is modelled as a workflow and integrated into the grid using a workflow manager. A graphical user interface is implemented as a grid portlet. It allows the initialization of new computation tasks as well as the monitoring and result-retrieval from already launched analyses.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126519913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Market-Oriented Cloud Computing: Vision, Hype, and Reality of Delivering Computing as the 5th Utility 以市场为导向的云计算:作为第五种实用工具交付计算的愿景、炒作和现实
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.97
R. Buyya
Computing is being transformed to a model consisting of services that are commoditised and delivered in a manner similar to utilities such as water, electricity, gas, and telephony. In such a model, users access services based on their requirements without regard to where the services are hosted. Several computing paradigms have promised to deliver this utility computing vision and they include Grid computing, P2P computing, and more recently Cloud computing. The latter term denotes the infrastructure as a “Cloud” in which businesses and users are able to access applications from anywhere in the world on demand. Hence, Cloud computing can be classed as a new paradigm for the dynamic creation of next-generation Data Centers by assembling services of networked Virtual Machines (VMs). Thus, the computing world is rapidly transforming towards developing software for millions to consume as a service rather than creating software for millions to run on their PCs.
计算正在转变为一种由服务组成的模型,这些服务被商品化,并以类似于水、电、气和电话等公用事业的方式交付。在这样的模型中,用户根据他们的需求访问服务,而不考虑服务托管在哪里。一些计算范式已经承诺提供这种效用计算的愿景,它们包括网格计算、P2P计算和最近的云计算。后一个术语将基础设施表示为“云”,其中企业和用户能够根据需要从世界任何地方访问应用程序。因此,云计算可以被归类为通过组装网络虚拟机(vm)的服务来动态创建下一代数据中心的新范例。因此,计算机世界正在迅速转变为为数百万人开发软件,将其作为一种服务来消费,而不是为数百万人开发软件,让他们在个人电脑上运行。
{"title":"Market-Oriented Cloud Computing: Vision, Hype, and Reality of Delivering Computing as the 5th Utility","authors":"R. Buyya","doi":"10.1109/CCGRID.2009.97","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.97","url":null,"abstract":"Computing is being transformed to a model consisting of services that are commoditised and delivered in a manner similar to utilities such as water, electricity, gas, and telephony. In such a model, users access services based on their requirements without regard to where the services are hosted. Several computing paradigms have promised to deliver this utility computing vision and they include Grid computing, P2P computing, and more recently Cloud computing. The latter term denotes the infrastructure as a “Cloud” in which businesses and users are able to access applications from anywhere in the world on demand. Hence, Cloud computing can be classed as a new paradigm for the dynamic creation of next-generation Data Centers by assembling services of networked Virtual Machines (VMs). Thus, the computing world is rapidly transforming towards developing software for millions to consume as a service rather than creating software for millions to run on their PCs.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130650792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 236
Scheduling Strategies for Cycle Scavenging in Multicluster Grid Systems 多集群网格系统循环清除的调度策略
Pub Date : 2009-05-18 DOI: 10.1109/CCGRID.2009.46
Omer Ozan Sonmez, B. Grundeken, H. Mohamed, A. Iosup, D. Epema
The use of today's multicluster grids exhibits periods of submission bursts with periods of normal use and even of idleness. To avoid resource contention, many users employ observational scheduling, that is, they postpone the submission of relatively low-priority jobs until a cluster becomes (largely) idle. However, observational scheduling leads to resource contention when several such users crowd the same idle cluster. Moreover, this job execution model either delays the execution of more important jobs, or requires extensive administrative support for job and user priorities. Instead, in this work we investigate the use of cycle scavenging to run jobs on grid resources politely yet efficiently, and with an acceptable administrative cost. We design a two-level cycle scavenging scheduling architecture that runs unobtrusively alongside regular grid scheduling. We equip this scheduler with two novel cycle scavenging scheduling policies that enforce fair resource sharing among competing cycle scavenging users. We show through experiments with real and synthetic applications in a real multicluster grid that the proposed architecture can execute jobs politely yet efficiently.
今天的多集群网格的使用呈现出一段时间的提交爆发,一段时间的正常使用,甚至是空闲。为了避免资源争用,许多用户采用观察调度,也就是说,他们推迟提交相对低优先级的作业,直到集群(基本上)空闲。然而,当几个这样的用户挤在同一个空闲集群中时,观察调度会导致资源争用。此外,此作业执行模型要么延迟更重要的作业的执行,要么需要对作业和用户优先级提供广泛的管理支持。相反,在这项工作中,我们研究了使用循环清除在网格资源上礼貌而高效地运行作业,并且具有可接受的管理成本。我们设计了一个两级循环清理调度架构,它与常规网格调度一起不显眼地运行。我们为这个调度器配备了两个新的周期清除调度策略,以强制竞争周期清除用户之间公平共享资源。我们通过在真实多集群网格中的真实和合成应用程序的实验表明,所提出的架构可以礼貌而高效地执行任务。
{"title":"Scheduling Strategies for Cycle Scavenging in Multicluster Grid Systems","authors":"Omer Ozan Sonmez, B. Grundeken, H. Mohamed, A. Iosup, D. Epema","doi":"10.1109/CCGRID.2009.46","DOIUrl":"https://doi.org/10.1109/CCGRID.2009.46","url":null,"abstract":"The use of today's multicluster grids exhibits periods of submission bursts with periods of normal use and even of idleness. To avoid resource contention, many users employ observational scheduling, that is, they postpone the submission of relatively low-priority jobs until a cluster becomes (largely) idle. However, observational scheduling leads to resource contention when several such users crowd the same idle cluster. Moreover, this job execution model either delays the execution of more important jobs, or requires extensive administrative support for job and user priorities. Instead, in this work we investigate the use of cycle scavenging to run jobs on grid resources politely yet efficiently, and with an acceptable administrative cost. We design a two-level cycle scavenging scheduling architecture that runs unobtrusively alongside regular grid scheduling. We equip this scheduler with two novel cycle scavenging scheduling policies that enforce fair resource sharing among competing cycle scavenging users. We show through experiments with real and synthetic applications in a real multicluster grid that the proposed architecture can execute jobs politely yet efficiently.","PeriodicalId":118263,"journal":{"name":"2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid","volume":"368 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2009-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134388572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
期刊
2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1