首页 > 最新文献

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing最新文献

英文 中文
Availability Prediction Based Replication Strategies for Grid Environments 网格环境中基于可用性预测的复制策略
Brent Rood, M. Lewis
Volunteer-based grid computing resources are characteristically volatile and frequently become unavailable due to the autonomy that owners maintain over them. This resource volatility has significant influence on the applications the resources host. Availability predictors can forecast unavailability, and can provide schedulers with information about reliability, which helps them make better scheduling decisions when combined with information about speed and load. This paper studies using this prediction information for deciding when to replicate jobs. In particular, our predictors forecast the probability that a job will complete uninterrupted, and our schedulers replicate those jobs that are least likely to do so. Our strategies outperform other comparable replication strategies, as measured by improved make span and fewer redundant operations. We define a new ``replication efficiency" metric, and demonstrate that our availability predictor can provide information that allows our schedulers to be more efficient than the most closely related replication strategy for a variety of loads in a trace-based grid simulation. We demonstrate that under low load conditions, our techniques come within 6% of the makespan improvement of a previously proposed replication technique while creating 76.8% fewer replicas and under higher loads, can improve makespan marginally while creating 72.5% fewer replicas.
基于志愿者的网格计算资源的特点是不稳定的,并且由于所有者对它们的自主权而经常变得不可用。这种资源的波动性对资源所承载的应用程序有很大的影响。可用性预测器可以预测不可用性,并且可以为调度器提供有关可靠性的信息,当与有关速度和负载的信息结合使用时,这些信息有助于调度器做出更好的调度决策。本文研究使用这些预测信息来决定何时复制作业。特别是,我们的预测器预测作业不间断完成的概率,我们的调度器复制那些最不可能完成的作业。通过改进的make span和更少的冗余操作来衡量,我们的策略优于其他类似的复制策略。我们定义了一个新的“复制效率”指标,并演示了我们的可用性预测器可以提供的信息,使我们的调度器在基于跟踪的网格模拟中的各种负载上比最密切相关的复制策略更有效。我们证明,在低负载条件下,我们的技术比之前提出的复制技术改进的最大时间跨度不到6%,同时创建的副本减少了76.8%;在高负载条件下,可以略微提高最大时间跨度,同时创建的副本减少了72.5%。
{"title":"Availability Prediction Based Replication Strategies for Grid Environments","authors":"Brent Rood, M. Lewis","doi":"10.1109/CCGRID.2010.121","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.121","url":null,"abstract":"Volunteer-based grid computing resources are characteristically volatile and frequently become unavailable due to the autonomy that owners maintain over them. This resource volatility has significant influence on the applications the resources host. Availability predictors can forecast unavailability, and can provide schedulers with information about reliability, which helps them make better scheduling decisions when combined with information about speed and load. This paper studies using this prediction information for deciding when to replicate jobs. In particular, our predictors forecast the probability that a job will complete uninterrupted, and our schedulers replicate those jobs that are least likely to do so. Our strategies outperform other comparable replication strategies, as measured by improved make span and fewer redundant operations. We define a new ``replication efficiency\" metric, and demonstrate that our availability predictor can provide information that allows our schedulers to be more efficient than the most closely related replication strategy for a variety of loads in a trace-based grid simulation. We demonstrate that under low load conditions, our techniques come within 6% of the makespan improvement of a previously proposed replication technique while creating 76.8% fewer replicas and under higher loads, can improve makespan marginally while creating 72.5% fewer replicas.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134240623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
TOPP goes Rapid The OpenMS Proteomics Pipeline in a Grid-Enabled Web Portal TOPP走得很快:OpenMS蛋白质组学管道在一个网格支持的Web门户
S. Gesing, Jano van Hemert, J. Koetsier, A. Bertsch, O. Kohlbacher
Proteomics, the study of all the proteins contained in a particular sample, e.g., a cell, is a key technology in current biomedical research. The complexity and volume of proteomics data sets produced by mass spectrometric methods clearly suggests the use of grid-based high-performance computing for analysis. TOPP and OpenMS are open-source packages for proteomics data analysis, however, they do not provide support for Grid computing. In this work we present a portal interface for high-throughput data analysis with TOPP. The portal is based on Rapid, a tool for efficiently generating standardized port lets for a wide range of applications. The web-based interface allows the creation and editing of user-defined pipelines and their execution and monitoring on a Grid infrastructure. The portal also supports several file transfer protocols for data staging. It thus provides a simple and complete solution to high-throughput proteomics data analysis for inexperienced users through a convenient portal interface.
蛋白质组学是对特定样品(如细胞)中包含的所有蛋白质的研究,是当前生物医学研究中的一项关键技术。质谱方法产生的蛋白质组学数据集的复杂性和体积清楚地表明使用基于网格的高性能计算进行分析。TOPP和OpenMS是用于蛋白质组学数据分析的开源软件包,但是它们不支持网格计算。在这项工作中,我们提出了一个使用TOPP进行高吞吐量数据分析的门户接口。该门户基于Rapid,这是一种有效地为各种应用程序生成标准化端口的工具。基于web的接口允许创建和编辑用户定义的管道,以及在网格基础设施上执行和监视管道。门户还支持用于数据暂存的几种文件传输协议。因此,它通过一个方便的门户界面为没有经验的用户提供了一个简单而完整的高通量蛋白质组学数据分析解决方案。
{"title":"TOPP goes Rapid The OpenMS Proteomics Pipeline in a Grid-Enabled Web Portal","authors":"S. Gesing, Jano van Hemert, J. Koetsier, A. Bertsch, O. Kohlbacher","doi":"10.1109/CCGRID.2010.39","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.39","url":null,"abstract":"Proteomics, the study of all the proteins contained in a particular sample, e.g., a cell, is a key technology in current biomedical research. The complexity and volume of proteomics data sets produced by mass spectrometric methods clearly suggests the use of grid-based high-performance computing for analysis. TOPP and OpenMS are open-source packages for proteomics data analysis, however, they do not provide support for Grid computing. In this work we present a portal interface for high-throughput data analysis with TOPP. The portal is based on Rapid, a tool for efficiently generating standardized port lets for a wide range of applications. The web-based interface allows the creation and editing of user-defined pipelines and their execution and monitoring on a Grid infrastructure. The portal also supports several file transfer protocols for data staging. It thus provides a simple and complete solution to high-throughput proteomics data analysis for inexperienced users through a convenient portal interface.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132885412","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Effects of Untruthful Bids on User Utilities and Stability in Computing Markets 不诚实出价对用户效用和计算市场稳定性的影响
Sergei Shudler, Lior Amar, A. Barak, Ahuva Mu'alem
Markets of computing resources typically consist of a cluster (or a multi-cluster) and jobs that arrive over time and request computing resources in exchange for payment. In this paper we study a real system that is capable of preemptive process migration (i.e. moving jobs across nodes) and that uses a market-based resource allocation mechanism for job allocation. Specifically, we formalize our system into a market model and employ simulation-based analysis (performed on real data) to study the effects of users' behavior on performance and utility. Typically online settings are characterized by a large amount of uncertainty, therefore it is reasonable to assume that users will consider simple strategies to game the system. We thus suggest a novel approach to modeling users' behavior called the Small Risk-aggressive Group model. We show that under this model untruthful users experience degraded performance. The main result and the contribution of this paper is that using the k-th price payment scheme, which is a natural adaptation of the classical second-price scheme, discourages these users from attempting to game the market. The preemptive capability makes it possible not only to use the k-th price scheme, but also makes our scheduling algorithm superior to other non-preemptive algorithms. Finally, we design a simple one-shot game to model the interaction between the provider and the consumers. We then show (using the same simulation-based analysis) that market stability in the form of (symmetric) Nash-equilibrium is likely to be achieved in several cases.
计算资源市场通常由一个集群(或多集群)和作业组成,这些作业随着时间的推移到达,并请求计算资源以换取报酬。在本文中,我们研究了一个真实的系统,它能够先发制人的进程迁移(即跨节点移动作业),并使用基于市场的资源分配机制进行作业分配。具体来说,我们将我们的系统形式化为一个市场模型,并采用基于仿真的分析(在真实数据上执行)来研究用户行为对性能和效用的影响。典型的在线设置具有大量不确定性的特点,因此可以合理地假设用户将考虑简单的策略来与系统博弈。因此,我们提出了一种新的方法来模拟用户的行为,称为小风险-攻击性群体模型。我们表明,在这个模型下,不诚实的用户会经历性能下降。本文的主要结果和贡献是,使用第k个价格支付方案,这是经典的第二价格方案的自然适应,阻止了这些用户试图博弈市场。抢占性使得调度算法不仅可以使用第k个价格方案,而且优于其他非抢占性调度算法。最后,我们设计了一个简单的一次性游戏来模拟提供者和消费者之间的交互。然后,我们(使用相同的基于模拟的分析)表明,在几种情况下,(对称的)纳什均衡形式的市场稳定可能会实现。
{"title":"The Effects of Untruthful Bids on User Utilities and Stability in Computing Markets","authors":"Sergei Shudler, Lior Amar, A. Barak, Ahuva Mu'alem","doi":"10.1109/CCGRID.2010.57","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.57","url":null,"abstract":"Markets of computing resources typically consist of a cluster (or a multi-cluster) and jobs that arrive over time and request computing resources in exchange for payment. In this paper we study a real system that is capable of preemptive process migration (i.e. moving jobs across nodes) and that uses a market-based resource allocation mechanism for job allocation. Specifically, we formalize our system into a market model and employ simulation-based analysis (performed on real data) to study the effects of users' behavior on performance and utility. Typically online settings are characterized by a large amount of uncertainty, therefore it is reasonable to assume that users will consider simple strategies to game the system. We thus suggest a novel approach to modeling users' behavior called the Small Risk-aggressive Group model. We show that under this model untruthful users experience degraded performance. The main result and the contribution of this paper is that using the k-th price payment scheme, which is a natural adaptation of the classical second-price scheme, discourages these users from attempting to game the market. The preemptive capability makes it possible not only to use the k-th price scheme, but also makes our scheduling algorithm superior to other non-preemptive algorithms. Finally, we design a simple one-shot game to model the interaction between the provider and the consumers. We then show (using the same simulation-based analysis) that market stability in the form of (symmetric) Nash-equilibrium is likely to be achieved in several cases.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134474275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Scalable Communication Trace Compression 可扩展通信跟踪压缩
S. Krishnamoorthy, Khushbu Agarwal
Characterizing the communication behavior of parallel programs through tracing can help understand an application’s characteristics, model its performance, and predict behavior on future systems. However, lossless communication traces can get prohibitively large, causing programmers to resort to variety of other techniques. In this paper, we present a novel approach to lossless communication trace compression. We augment the sequitur compression algorithm to employ it in communication trace compression of parallel programs. We present optimizations to reduce the memory overhead, reduce size of the trace files generated, and enable compression across multiple processes in a parallel program. The evaluation shows improved compression and reduced overhead over other approaches, with up to 3 orders of magnitude improvement for the NAS MG benchmark. We also observe that, unlike existing schemes, the trace files sizes and the memory overhead incurred are less sensitive to, if not independent of, the problem size for the NAS benchmarks.
通过跟踪来描述并行程序的通信行为可以帮助理解应用程序的特征,为其性能建模,并预测未来系统的行为。然而,无损通信跟踪可能会变得非常大,导致程序员求助于各种其他技术。在本文中,我们提出了一种新的无损通信跟踪压缩方法。我们对sequitur压缩算法进行了扩充,将其应用于并行程序的通信跟踪压缩。我们提供的优化可以减少内存开销,减少生成的跟踪文件的大小,并在并行程序中支持跨多个进程的压缩。评估显示,与其他方法相比,压缩得到了改善,开销也减少了,NAS MG基准测试的改进幅度高达3个数量级。我们还观察到,与现有方案不同,跟踪文件大小和所产生的内存开销对NAS基准测试的问题大小不那么敏感(如果不是独立的话)。
{"title":"Scalable Communication Trace Compression","authors":"S. Krishnamoorthy, Khushbu Agarwal","doi":"10.1109/CCGRID.2010.111","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.111","url":null,"abstract":"Characterizing the communication behavior of parallel programs through tracing can help understand an application’s characteristics, model its performance, and predict behavior on future systems. However, lossless communication traces can get prohibitively large, causing programmers to resort to variety of other techniques. In this paper, we present a novel approach to lossless communication trace compression. We augment the sequitur compression algorithm to employ it in communication trace compression of parallel programs. We present optimizations to reduce the memory overhead, reduce size of the trace files generated, and enable compression across multiple processes in a parallel program. The evaluation shows improved compression and reduced overhead over other approaches, with up to 3 orders of magnitude improvement for the NAS MG benchmark. We also observe that, unlike existing schemes, the trace files sizes and the memory overhead incurred are less sensitive to, if not independent of, the problem size for the NAS benchmarks.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"135 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130120586","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Discovering Piecewise Linear Models of Grid Workload 网格工作负荷的分段线性模型研究
Tamás Éltetö, C. Germain, P. Bondon, M. Sebag
Despite extensive research focused on enabling QoS for grid users through economic and intelligent resource provisioning, no consensus has emerged on the most promising strategies. On top of intrinsically challenging problems, the complexity and size of data has so far drastically limited the number of comparative experiments. An alternative to experimenting on real, large, and complex data, is to look for well-founded and parsimonious representations. This study is based on exhaustive information about the gLite-monitored jobs from the EGEE grid, representative of a significant fraction of e-science computing activity in Europe. Our main contributions are twofold. First we found that workload models for this grid can consistently be discovered from the real data, and that limiting the range of models to piecewise linear time series models is sufficiently powerful. Second, we present a bootstrapping strategy for building more robust models from the limited samples at hand.
尽管广泛的研究集中在通过经济和智能资源配置为电网用户提供QoS上,但在最有希望的策略上还没有达成共识。除了本质上具有挑战性的问题之外,数据的复杂性和规模迄今为止极大地限制了比较实验的数量。对真实的、大型的、复杂的数据进行实验的另一种选择是寻找有充分根据的、简洁的表示。这项研究是基于EGEE网格中关于glite监测工作的详尽信息,代表了欧洲电子科学计算活动的很大一部分。我们的主要贡献是双重的。首先,我们发现该网格的工作负荷模型可以从实际数据中一致地发现,并且将模型的范围限制为分段线性时间序列模型是足够强大的。其次,我们提出了一种自举策略,用于从手头的有限样本中构建更健壮的模型。
{"title":"Discovering Piecewise Linear Models of Grid Workload","authors":"Tamás Éltetö, C. Germain, P. Bondon, M. Sebag","doi":"10.1109/CCGRID.2010.69","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.69","url":null,"abstract":"Despite extensive research focused on enabling QoS for grid users through economic and intelligent resource provisioning, no consensus has emerged on the most promising strategies. On top of intrinsically challenging problems, the complexity and size of data has so far drastically limited the number of comparative experiments. An alternative to experimenting on real, large, and complex data, is to look for well-founded and parsimonious representations. This study is based on exhaustive information about the gLite-monitored jobs from the EGEE grid, representative of a significant fraction of e-science computing activity in Europe. Our main contributions are twofold. First we found that workload models for this grid can consistently be discovered from the real data, and that limiting the range of models to piecewise linear time series models is sufficiently powerful. Second, we present a bootstrapping strategy for building more robust models from the limited samples at hand.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132184631","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
High Resolution Program Flow Visualization of Hardware Accelerated Hybrid Multi-core Applications 硬件加速混合多核应用程序的高分辨率程序流程可视化
D. Hackenberg, G. Juckeland, H. Brunst
The advent of multi-core processors has made parallel computing techniques mandatory on main stream systems. With the recent rise of hardware accelerators, hybrid parallelism adds yet another dimension of complexity to the process of software development. This article presents a tool for graphical program flow analysis of hardware accelerated parallel programs. It monitors the hybrid program execution to record and visualize many performance relevant events along the way. Representative real-world applications written for both IBM’s Cell processor and NVIDIA’s CUDA API are studied exemplarily. To the best of our knowledge, this approach is the first that visualizes the parallelism in hybrid multi-core systems at the presented level of detail.
多核处理器的出现使得并行计算技术成为主流系统的必备技术。随着最近硬件加速器的兴起,混合并行性为软件开发过程增加了另一个维度的复杂性。本文介绍了一个硬件加速并行程序的图形化程序流分析工具。它监视混合程序的执行,以记录和可视化许多与性能相关的事件。为IBM的Cell处理器和NVIDIA的CUDA API编写的具有代表性的实际应用程序进行了举例研究。据我们所知,这种方法是第一个将混合多核系统中的并行性可视化的方法。
{"title":"High Resolution Program Flow Visualization of Hardware Accelerated Hybrid Multi-core Applications","authors":"D. Hackenberg, G. Juckeland, H. Brunst","doi":"10.1109/CCGRID.2010.27","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.27","url":null,"abstract":"The advent of multi-core processors has made parallel computing techniques mandatory on main stream systems. With the recent rise of hardware accelerators, hybrid parallelism adds yet another dimension of complexity to the process of software development. This article presents a tool for graphical program flow analysis of hardware accelerated parallel programs. It monitors the hybrid program execution to record and visualize many performance relevant events along the way. Representative real-world applications written for both IBM’s Cell processor and NVIDIA’s CUDA API are studied exemplarily. To the best of our knowledge, this approach is the first that visualizes the parallelism in hybrid multi-core systems at the presented level of detail.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116584283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Service Oriented Approach to High Performance Scientific Computing 面向服务的高性能科学计算方法
J. Mulerikkal, P. Strazdins
The Service Oriented Architecture (SOA) is embraced in distributed and grid computing to produce high performance results for long time. SOA is likened by application programmers for its trademark characteristics of programmability, efficiency in heterogeneous conditions and fault-tolerance. It has worked well for high performance financial applications. but not for scientific applications which are too fine grained and communication intensive to be efficient on distributed environments. This paper argues that to make SOA model work well for those scientific applications, we need to reduce overhead costs associated with smaller task loads arising from finer granularity and increased communications in those applications. This paper proposes a data service to be used along with the existing compute services in SOA middlewares to enable inter-communication of finer tasks with out loosing SOA properties of programmability and efficiency under heterogeneity. This data service shall better enable high performance scientific computing of medium to fine grained scientific applications.
面向服务的体系结构(SOA)包含在分布式和网格计算中,可以长时间产生高性能结果。应用程序程序员将SOA比作可编程性、异构条件下的效率和容错性等标志性特征。它在高性能金融应用程序中运行良好。但对于过于细粒度和通信密集的科学应用程序来说,这并不适用于分布式环境。本文认为,为了使SOA模型能够很好地适用于这些科学应用程序,我们需要减少与这些应用程序中更细的粒度和增加的通信带来的更小的任务负载相关的开销成本。本文提出了一种数据服务与现有的计算服务一起在SOA中间件中使用,以实现更精细的任务之间的通信,同时又不会失去异构环境下SOA的可编程性和效率特性。该数据服务将更好地支持中到细粒度科学应用程序的高性能科学计算。
{"title":"Service Oriented Approach to High Performance Scientific Computing","authors":"J. Mulerikkal, P. Strazdins","doi":"10.1109/CCGRID.2010.93","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.93","url":null,"abstract":"The Service Oriented Architecture (SOA) is embraced in distributed and grid computing to produce high performance results for long time. SOA is likened by application programmers for its trademark characteristics of programmability, efficiency in heterogeneous conditions and fault-tolerance. It has worked well for high performance financial applications. but not for scientific applications which are too fine grained and communication intensive to be efficient on distributed environments. This paper argues that to make SOA model work well for those scientific applications, we need to reduce overhead costs associated with smaller task loads arising from finer granularity and increased communications in those applications. This paper proposes a data service to be used along with the existing compute services in SOA middlewares to enable inter-communication of finer tasks with out loosing SOA properties of programmability and efficiency under heterogeneity. This data service shall better enable high performance scientific computing of medium to fine grained scientific applications.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116603379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Profit-Driven Service Request Scheduling in Clouds 利润驱动的云服务请求调度
Young Choon Lee, Chen Wang, Albert Y. Zomaya, B. Zhou
A primary driving force of the recent cloud computing paradigm is its inherent cost effectiveness. As in many basic utilities, such as electricity and water, consumers/clients in cloud computing environments are charged based on their service usage, hence the term ‘pay-per-use’. While this pricing model is very appealing for both service providers and consumers, fluctuating service request volume and conflicting objectives (e.g., profit vs. response time) between providers and consumers hinder its effective application to cloud computing environments. In this paper, we address the problem of service request scheduling in cloud computing systems. We consider a three-tier cloud structure, which consists of infrastructure vendors, service providers and consumers, the latter two parties are particular interest to us. Clearly, scheduling strategies in this scenario should satisfy the objectives of both parties. Our contributions include the development of a pricing model—using processor-sharing—for clouds, the application of this pricing model to composite services with dependency consideration (to the best of our knowledge, the work in this study is the first attempt), and the development of two sets of profit-driven scheduling algorithms.
最近云计算范式的主要驱动力是其固有的成本效益。与许多基础公用事业(如电和水)一样,云计算环境中的消费者/客户是根据他们的服务使用情况收费的,因此有了“按使用付费”这个术语。虽然这种定价模式对服务提供商和消费者都非常有吸引力,但提供商和消费者之间波动的服务请求量和冲突的目标(例如,利润与响应时间)阻碍了其在云计算环境中的有效应用。本文研究了云计算系统中的服务请求调度问题。我们考虑一个三层云结构,它由基础设施供应商、服务提供商和消费者组成,后两方对我们特别感兴趣。显然,这个场景中的调度策略应该满足双方的目标。我们的贡献包括开发了一种定价模型——使用处理器共享——用于云计算,将这种定价模型应用于考虑依赖性的组合服务(据我们所知,本研究中的工作是第一次尝试),以及开发了两套利润驱动的调度算法。
{"title":"Profit-Driven Service Request Scheduling in Clouds","authors":"Young Choon Lee, Chen Wang, Albert Y. Zomaya, B. Zhou","doi":"10.1109/CCGRID.2010.83","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.83","url":null,"abstract":"A primary driving force of the recent cloud computing paradigm is its inherent cost effectiveness. As in many basic utilities, such as electricity and water, consumers/clients in cloud computing environments are charged based on their service usage, hence the term ‘pay-per-use’. While this pricing model is very appealing for both service providers and consumers, fluctuating service request volume and conflicting objectives (e.g., profit vs. response time) between providers and consumers hinder its effective application to cloud computing environments. In this paper, we address the problem of service request scheduling in cloud computing systems. We consider a three-tier cloud structure, which consists of infrastructure vendors, service providers and consumers, the latter two parties are particular interest to us. Clearly, scheduling strategies in this scenario should satisfy the objectives of both parties. Our contributions include the development of a pricing model—using processor-sharing—for clouds, the application of this pricing model to composite services with dependency consideration (to the best of our knowledge, the work in this study is the first attempt), and the development of two sets of profit-driven scheduling algorithms.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126190026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 188
High Performance Data Transfer in Grid Environment Using GridFTP over InfiniBand 网格环境下基于GridFTP的高性能数据传输
H. Subramoni, P. Lai, R. Kettimuthu, D. Panda
GridFTP, designed using the Globus XIO framework, is one of the most popular methods in use to perform data transfers in the grid environment. But the performance of GridFTP in WAN is limited by the relatively low communication bandwidth offered by the existing network protocols. On the other hand, modern interconnects such as InfiniBand, with many advanced communication features like zero-copy protocol and RDMA operations, can greatly improve communication efficiency. In this paper, we take on the challenge of combining the ease of use of the Globus XIO framework and the high performance achieved through InfiniBand communication, thereby natively sup-porting GridFTP over InfiniBand based networks. The Advanced Data Transfer Service (ADTS), designed in our previous work, provides the low level InfiniBand support to the Globus XIO layer. We introduce the concepts of I/Ostaging in the Globus XIO ADTS driver to achieve efficient disk based data transfers. We evaluate our designs in both LAN and WAN environments using micro benchmarks as well as communication traces from several real world applications. We also provide insights into the communication performance with some in-depth analysis. Our experimental evaluation shows a performance improvement of up to100% for ADTS based data transfers as opposed to TCP or UDP based ones in LAN and high delay WAN scenarios.
GridFTP是使用Globus XIO框架设计的,是在网格环境中执行数据传输的最常用方法之一。但是GridFTP在广域网中的性能受到现有网络协议提供的相对较低的通信带宽的限制。另一方面,现代互连,如InfiniBand,具有许多先进的通信特性,如零复制协议和RDMA操作,可以大大提高通信效率。在本文中,我们接受了将Globus XIO框架的易用性与通过InfiniBand通信实现的高性能相结合的挑战,从而在基于InfiniBand的网络上本地支持GridFTP。在我们之前的工作中设计的高级数据传输服务(ADTS)为Globus XIO层提供了低级的InfiniBand支持。我们在Globus XIO ADTS驱动程序中引入I/Ostaging的概念,以实现高效的基于磁盘的数据传输。我们在局域网和广域网环境中使用微基准测试以及来自几个真实世界应用程序的通信跟踪来评估我们的设计。我们还通过一些深入的分析提供了对通信性能的见解。我们的实验评估显示,在LAN和高延迟WAN场景中,基于ADTS的数据传输与基于TCP或UDP的数据传输相比,性能提高高达100%。
{"title":"High Performance Data Transfer in Grid Environment Using GridFTP over InfiniBand","authors":"H. Subramoni, P. Lai, R. Kettimuthu, D. Panda","doi":"10.1109/CCGRID.2010.115","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.115","url":null,"abstract":"GridFTP, designed using the Globus XIO framework, is one of the most popular methods in use to perform data transfers in the grid environment. But the performance of GridFTP in WAN is limited by the relatively low communication bandwidth offered by the existing network protocols. On the other hand, modern interconnects such as InfiniBand, with many advanced communication features like zero-copy protocol and RDMA operations, can greatly improve communication efficiency. In this paper, we take on the challenge of combining the ease of use of the Globus XIO framework and the high performance achieved through InfiniBand communication, thereby natively sup-porting GridFTP over InfiniBand based networks. The Advanced Data Transfer Service (ADTS), designed in our previous work, provides the low level InfiniBand support to the Globus XIO layer. We introduce the concepts of I/Ostaging in the Globus XIO ADTS driver to achieve efficient disk based data transfers. We evaluate our designs in both LAN and WAN environments using micro benchmarks as well as communication traces from several real world applications. We also provide insights into the communication performance with some in-depth analysis. Our experimental evaluation shows a performance improvement of up to100% for ADTS based data transfers as opposed to TCP or UDP based ones in LAN and high delay WAN scenarios.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"103 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127132689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
SciCloud: Scientific Computing on the Cloud SciCloud:基于云的科学计算
S. Srirama, Oleg Batrashev, E. Vainikko
SciCloud is a project studying the scope of establishing private clouds at universities. With these clouds, researchers can efficiently use the already existing resources in solving computationally intensive scientific, mathematical, and academic problems. The project established a Eucalyptus based private cloud and developed several customized images that can be used in solving problems from mobile web services, distributed computing and bio-informatics domains. The poster demonstrates the SciCloud and reveals two applications that are benefiting from the setup along with our research scope and results in scientific computing.
SciCloud是一个研究在大学建立私有云范围的项目。有了这些云,研究人员可以有效地利用已经存在的资源来解决计算密集型的科学、数学和学术问题。该项目建立了一个基于Eucalyptus的私有云,并开发了几个可用于解决移动web服务、分布式计算和生物信息学领域问题的定制映像。海报展示了SciCloud,并揭示了两个受益于该设置的应用程序以及我们在科学计算方面的研究范围和结果。
{"title":"SciCloud: Scientific Computing on the Cloud","authors":"S. Srirama, Oleg Batrashev, E. Vainikko","doi":"10.1109/CCGRID.2010.56","DOIUrl":"https://doi.org/10.1109/CCGRID.2010.56","url":null,"abstract":"SciCloud is a project studying the scope of establishing private clouds at universities. With these clouds, researchers can efficiently use the already existing resources in solving computationally intensive scientific, mathematical, and academic problems. The project established a Eucalyptus based private cloud and developed several customized images that can be used in solving problems from mobile web services, distributed computing and bio-informatics domains. The poster demonstrates the SciCloud and reveals two applications that are benefiting from the setup along with our research scope and results in scientific computing.","PeriodicalId":444485,"journal":{"name":"2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2010-05-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129478608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
期刊
2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1