CloudCP '11最新文献

英文中文

Diagnosis of application server performance problems via thread level pattern analysis 通过线程级模式分析诊断应用服务器性能问题

CloudCP '11

Pub Date : 2011-04-10 DOI: 10.1145/1967422.1967424

John Liang, Xuwen Yu, A. Desai, Aiguo Dong, Rajit Kambo

Task level breakdown provides high level information for operation performance bottleneck analysis. Creating a task level breakdown requires prior knowledge of the application server program logic and can be very time consuming. In this paper, we present yShark, a profiling tool which generates a task level breakdown chart automatically without prior knowledge. With thread level pattern analysis, performance bottlenecks can be easily located. We illustrate the innovative features of this tool with two real-world application case studies. First, we show how performance bottlenecks from modules both within and external to the system can be easily identified by the tool; second, we present a pattern detection and pattern matching algorithm to detect duplicate sequential tasks that can be run in parallel. We also show how this tool can be useful in VMware vCloud Director production environments due to the minimal profiling overhead.

任务级别分解为操作性能瓶颈分析提供高级信息。创建任务级分解需要事先了解应用服务器程序逻辑，并且可能非常耗时。在本文中，我们提出了一个分析工具yShark，它可以在没有先验知识的情况下自动生成任务级分解图。通过线程级模式分析，可以很容易地找到性能瓶颈。我们通过两个实际应用案例研究来说明该工具的创新特性。首先，我们展示了如何通过工具轻松识别来自系统内部和外部模块的性能瓶颈;其次，我们提出了一种模式检测和模式匹配算法来检测可以并行运行的重复顺序任务。我们还展示了该工具如何在VMware vCloud Director生产环境中发挥作用，因为它的分析开销最小。

引用次数: 0

The KOALA cloud management service: a modern approach for cloud infrastructure management KOALA云管理服务:用于云基础设施管理的现代方法

CloudCP '11

Pub Date : 2011-04-10 DOI: 10.1145/1967422.1967423

C. Baun, M. Kunze

While the variety of public and private cloud infrastructure and storage service offerings increases, only few tools exist to efficiently manage hybrid cloud resources of different cloud providers. KOALA is a novel cloud management tool that allows to work with a large variety of services of various public and private cloud providers in a seamless and transparent way. While most management solutions like command-line tools or browser extensions require a local installation, KOALA follows the cloud paradigm and operates itself as a software service on the basis of a public or private cloud platform. In addition, KOALA executes alternatively standalone with a Linux or Mac OS X system. The KOALA cloud manager allows to control almost all existing cloud resources with an API compatible to the Amazon Web Services. The users of KOALA have the freedom of choice to use any browser and device to interact with the cloud services. Especially in cases where the users have strong requirements regarding security and privacy, it is a benefit to run the management solution for different cloud services in a private context.

虽然各种各样的公共和私有云基础设施和存储服务产品在增加，但只有很少的工具可以有效地管理不同云提供商的混合云资源。KOALA是一种新颖的云管理工具，允许以无缝和透明的方式与各种公共和私有云提供商的各种服务一起工作。虽然大多数管理解决方案(如命令行工具或浏览器扩展)都需要在本地安装，但KOALA遵循云范式，并在公共或私有云平台的基础上作为软件服务运行。此外，KOALA可以在Linux或Mac OS X系统上独立执行。KOALA云管理器允许使用与Amazon Web Services兼容的API来控制几乎所有现有的云资源。KOALA的用户可以自由选择使用任何浏览器和设备与云服务进行交互。特别是在用户对安全性和隐私有强烈需求的情况下，在私有上下文中为不同的云服务运行管理解决方案是有好处的。

引用次数: 9

COSCA: an easy-to-use component-based PaaS cloud system for common applications COSCA:用于通用应用程序的易于使用的基于组件的PaaS云系统

CloudCP '11

Pub Date : 2011-04-10 DOI: 10.1145/1967422.1967426

Steffen Kächele, Jörg Domaschka, F. Hauck

The emergence of cloud computing marks a significant change in the way computers are used in both enterprise and personal environments. Yet, as a young technology, cloud computing is far from being mature. Platform-as-a-service (PaaS) clouds promise to reduce maintenance and administration costs, but current frameworks lack crucial features for supporting a broad range of applications. Especially rigid constraints of the current PaaS programming models limit broader usage. Based on this observation we compiled eleven requirements of typical business applications such as programming model, placement, scalability, routing, isolation, load balancing, accounting, adaptability and modularity. We further observe that none of current platforms support a majority of these requested features. As a result, we present our own PaaS system, called COSCA that meets all of these requirements. COSCA's component-based design especially supports adaptability and modularity. We believe that our requirements and architecture may serve as a valuable guide for PaaS designers, implementers, and providers.

云计算的出现标志着计算机在企业和个人环境中的使用方式发生了重大变化。然而，作为一项年轻的技术，云计算还远未成熟。平台即服务(PaaS)云承诺降低维护和管理成本，但目前的框架缺乏支持广泛应用程序的关键特性。特别是当前PaaS编程模型的严格约束限制了更广泛的使用。基于这一观察，我们编制了典型业务应用程序的11个需求，如编程模型、布局、可伸缩性、路由、隔离、负载平衡、记帐、适应性和模块化。我们进一步观察到，目前没有一个平台支持这些要求的大部分功能。因此，我们提出了自己的PaaS系统，称为COSCA，它满足所有这些需求。COSCA基于组件的设计特别支持适应性和模块化。我们相信我们的需求和体系结构可以作为PaaS设计者、实现者和提供者的有价值的指南。

引用次数: 27

Assessing the overhead and scalability of system monitors for large data centers 评估大型数据中心系统监视器的开销和可伸缩性

CloudCP '11

Pub Date : 2011-04-10 DOI: 10.1145/1967422.1967425

M. Andreolini, M. Colajanni, R. Lancellotti

Current data centers are shifting towards cloud-based architectures as a means to obtain a scalable, cost-effective, robust service platform. In spite of this, the underlying management infrastructure has grown in terms of hardware resources and software complexity, making automated resource monitoring a necessity. There are several infrastructure monitoring tools designed to scale to a very high number of physical nodes. However, these tools either collect performance measure at a low frequency (missing the chance to capture the dynamics of a short-term management task) or are simply not equipped with instrumentation specific to cloud computing and virtualization. In this scenario, monitoring the correctness and efficiency of live migrations can become a nightmare. This situation will only worsen in the future, with the increased service demand due to spreading of the user base. In this paper, we assess the scalability of a prototype monitoring subsystem for different user scenarios. We also identify all the major bottlenecks and give insight on how to remove them.

当前的数据中心正在转向基于云的架构，以此作为获得可扩展、经济高效、健壮的服务平台的一种手段。尽管如此，底层管理基础设施在硬件资源和软件复杂性方面已经增长，使得自动化资源监控成为必要。有几种基础设施监控工具设计用于扩展到非常多的物理节点。然而，这些工具要么以较低的频率收集性能度量(错失捕捉短期管理任务动态的机会)，要么根本没有配备特定于云计算和虚拟化的工具。在这种情况下，监视实时迁移的正确性和效率可能会成为一场噩梦。随着用户群的扩大，服务需求的增加，这种情况在未来只会恶化。在本文中，我们评估了原型监控子系统在不同用户场景下的可扩展性。我们还确定了所有主要瓶颈，并提供了如何消除它们的见解。

引用次数: 4

Optimizing intermediate data management in MapReduce computations 优化MapReduce计算中的中间数据管理

CloudCP '11

Pub Date : 2011-04-10 DOI: 10.1145/1967422.1967427

Diana Moise, Thi-Thu-Lan Trieu, L. Bougé, Gabriel Antoniu

Many cloud computations process large datasets. Programming paradigms have been proposed to design this type of applications, so as to take advantage of the huge processing and storage options the cloud holds, but at the same time, to provide the user with a clean and easy to use interface. Among these programming models, we consider the MapReduce paradigm and its reference implementation, the Hadoop framework. We focus on the aspect of intermediate data, that is data produced and transferred between the two stages of the computation (map and reduce). The goal of this paper is to propose a storage mechanism for intermediate data with the purpose of optimizing the execution of MapReduce applications in the presence of failures, while keeping the impact on the job completion time to the minimum. To meet this goal, we rely on a fault-tolerant, concurrency-optimized data storage layer based on the BlobSeer data management service. We modify the Hadoop MapReduce framework to store the intermediate data in this layer (acting as a BlobSeer-based distributed file system) rather than using the local storage of the mappers, as in the vanilla version of Hadoop. To validate this work, we perform experiments on a large number of nodes of the Grid'5000 testbed. We demonstrate that our approach not only provides for intermediate data availability in case of failures, but also efficiently handles read/write accesses so that the overall job completion time is substantially improved.

许多云计算处理大型数据集。已经提出了编程范例来设计这种类型的应用程序，以便利用云所拥有的巨大处理和存储选项，但同时，为用户提供一个干净易用的界面。在这些编程模型中，我们考虑了MapReduce范式及其参考实现——Hadoop框架。我们关注中间数据方面，即在计算的两个阶段(map和reduce)之间产生和传输的数据。本文的目标是提出一种中间数据的存储机制，目的是在出现故障时优化MapReduce应用程序的执行，同时将对作业完成时间的影响降到最低。为了实现这一目标，我们依赖于基于BlobSeer数据管理服务的容错、并发优化的数据存储层。我们修改了Hadoop MapReduce框架，将中间数据存储在这一层(作为基于blobseer的分布式文件系统)，而不是像Hadoop的普通版本那样使用映射器的本地存储。为了验证这一工作，我们在Grid’5000测试台的大量节点上进行了实验。我们证明，我们的方法不仅在发生故障时提供了中间数据可用性，而且还有效地处理读/写访问，从而大大提高了总体作业完成时间。

{"title":"Optimizing intermediate data management in MapReduce computations","authors":"Diana Moise, Thi-Thu-Lan Trieu, L. Bougé, Gabriel Antoniu","doi":"10.1145/1967422.1967427","DOIUrl":"https://doi.org/10.1145/1967422.1967427","url":null,"abstract":"Many cloud computations process large datasets. Programming paradigms have been proposed to design this type of applications, so as to take advantage of the huge processing and storage options the cloud holds, but at the same time, to provide the user with a clean and easy to use interface. Among these programming models, we consider the MapReduce paradigm and its reference implementation, the Hadoop framework. We focus on the aspect of intermediate data, that is data produced and transferred between the two stages of the computation (map and reduce). The goal of this paper is to propose a storage mechanism for intermediate data with the purpose of optimizing the execution of MapReduce applications in the presence of failures, while keeping the impact on the job completion time to the minimum. To meet this goal, we rely on a fault-tolerant, concurrency-optimized data storage layer based on the BlobSeer data management service. We modify the Hadoop MapReduce framework to store the intermediate data in this layer (acting as a BlobSeer-based distributed file system) rather than using the local storage of the mappers, as in the vanilla version of Hadoop. To validate this work, we perform experiments on a large number of nodes of the Grid'5000 testbed. We demonstrate that our approach not only provides for intermediate data availability in case of failures, but also efficiently handles read/write accesses so that the overall job completion time is substantially improved.","PeriodicalId":365270,"journal":{"name":"CloudCP '11","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123656369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

CloudCP '11

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀