首页 > 最新文献

2015 IEEE International Conference on Cloud Engineering最新文献

英文 中文
Information Flow Control for Strong Protection with Flexible Sharing in PaaS PaaS中灵活共享的强保护信息流控制
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.64
Thomas Pasquier, Jatinder Singh, J. Bacon
The need to share data across applications is becoming increasingly evident. Current cloud isolation mechanisms focus solely on protection, such as containers that isolate at the OS-level, and virtual machines that isolate through the hypervisor. However, by focusing rigidly on protection, these approaches do not provide for controlled sharing. This paper presents how Information Flow Control (IFC) offers a flexible alternative. As a data-centric mechanism it enables strong isolation when required, while providing continuous, fine grained control of the data being shared. An IFC-enabled cloud platform would ensure that policies are enforced as data flows across all applications, without requiring any special sharing mechanisms.
跨应用程序共享数据的需求变得越来越明显。当前的云隔离机制仅关注于保护,例如在操作系统级别隔离的容器,以及通过管理程序隔离的虚拟机。然而,由于严格关注保护,这些方法不能提供可控的共享。本文介绍了信息流控制(IFC)如何提供一种灵活的替代方案。作为一种以数据为中心的机制,它在需要时支持强隔离,同时提供对共享数据的连续、细粒度控制。支持ifc的云平台将确保在所有应用程序中作为数据流执行策略,而不需要任何特殊的共享机制。
{"title":"Information Flow Control for Strong Protection with Flexible Sharing in PaaS","authors":"Thomas Pasquier, Jatinder Singh, J. Bacon","doi":"10.1109/IC2E.2015.64","DOIUrl":"https://doi.org/10.1109/IC2E.2015.64","url":null,"abstract":"The need to share data across applications is becoming increasingly evident. Current cloud isolation mechanisms focus solely on protection, such as containers that isolate at the OS-level, and virtual machines that isolate through the hypervisor. However, by focusing rigidly on protection, these approaches do not provide for controlled sharing. This paper presents how Information Flow Control (IFC) offers a flexible alternative. As a data-centric mechanism it enables strong isolation when required, while providing continuous, fine grained control of the data being shared. An IFC-enabled cloud platform would ensure that policies are enforced as data flows across all applications, without requiring any special sharing mechanisms.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121842725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
I/O Performance Modeling for Big Data Applications over Cloud Infrastructures 基于云基础设施的大数据应用I/O性能建模
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.29
Ioannis Mytilinis, Dimitrios Tsoumakos, Verena Kantere, Anastassios Nanos, N. Koziris
Big Data applications receive an ever-increasing amount of attention, thus becoming a dominant class of applications that are deployed over virtualized environments. Cloud environments entail a large amount of complexity relative to I/O performance. The use of Big Data increases the complexity of I/O management as well as its characterization and prediction: As I/O operations become growingly dominant in such applications, the intricacies of virtualization, different storage back ends and deployment setups significantly hinder our ability to analyze and correctly predict I/O performance. To that end, this work proposes an end-to-end modeling technique to predict performance of I/O--intensive Big Data applications running over cloud infrastructures. We develop a model tuned over application and infrastructure dimensions: Primitive I/O operations, data access patterns, storage back ends and deployment parameters. The trained model can be used to predict both I/O but also general task performance. Our evaluation results show that for jobs which are dominated by I/O operations, such as I/O-bound MapReduce jobs, our model is capable of predicting execution time with an accuracy close to 90% that decreases as application processing becomes more complex.
大数据应用受到越来越多的关注,因此成为部署在虚拟化环境上的主要应用类别。相对于I/O性能,云环境带来了大量的复杂性。大数据的使用增加了I/O管理及其特征和预测的复杂性:随着I/O操作在此类应用程序中越来越占主导地位,虚拟化、不同存储后端和部署设置的复杂性极大地阻碍了我们分析和正确预测I/O性能的能力。为此,本研究提出了一种端到端建模技术,用于预测在云基础设施上运行的I/O密集型大数据应用程序的性能。我们开发了一个针对应用程序和基础设施维度进行调整的模型:基本I/O操作、数据访问模式、存储后端和部署参数。经过训练的模型既可用于预测I/O,也可用于预测一般任务性能。我们的评估结果表明,对于I/O操作占主导地位的作业,例如I/O绑定的MapReduce作业,我们的模型能够以接近90%的准确率预测执行时间,随着应用程序处理变得更复杂而降低。
{"title":"I/O Performance Modeling for Big Data Applications over Cloud Infrastructures","authors":"Ioannis Mytilinis, Dimitrios Tsoumakos, Verena Kantere, Anastassios Nanos, N. Koziris","doi":"10.1109/IC2E.2015.29","DOIUrl":"https://doi.org/10.1109/IC2E.2015.29","url":null,"abstract":"Big Data applications receive an ever-increasing amount of attention, thus becoming a dominant class of applications that are deployed over virtualized environments. Cloud environments entail a large amount of complexity relative to I/O performance. The use of Big Data increases the complexity of I/O management as well as its characterization and prediction: As I/O operations become growingly dominant in such applications, the intricacies of virtualization, different storage back ends and deployment setups significantly hinder our ability to analyze and correctly predict I/O performance. To that end, this work proposes an end-to-end modeling technique to predict performance of I/O--intensive Big Data applications running over cloud infrastructures. We develop a model tuned over application and infrastructure dimensions: Primitive I/O operations, data access patterns, storage back ends and deployment parameters. The trained model can be used to predict both I/O but also general task performance. Our evaluation results show that for jobs which are dominated by I/O operations, such as I/O-bound MapReduce jobs, our model is capable of predicting execution time with an accuracy close to 90% that decreases as application processing becomes more complex.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"66 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132571565","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
FIDDLE: Federated Infrastructure Discovery and Description Language 联邦基础设施发现和描述语言
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.77
A. Willner, R. Loughnane, T. Magedanz
Considerable efforts have been spent on designing architectures to manage heterogeneous resources across multiple administrative domains. Specific fields of application are federated cloud computing (Intercloud) approaches and distributed testbeds, among others. An important interoperability challenge that arises in this context is the exchange of information about the provided resources and their dependencies. Existing work usually rests upon schematic data models, which impede the discovery and management of heterogeneous resources between autonomous sites. One way of addressing this issue is to exchange semantic information models. In this paper, we exploit such approaches to formally define federations, including their infrastructures and the life-cycle of the offered resources and services. The requirements of this work have been derived from several research projects and the results are in process of being standardized by an international body. The main contribution of this work is a higher level (upper) ontology and initial integration concepts for it. These contributions form a basis for further work in the general context of distributed semantic resource management.
在设计跨多个管理域管理异构资源的体系结构上已经花费了大量的精力。具体的应用领域包括联合云计算(Intercloud)方法和分布式测试平台等。在此上下文中出现的一个重要的互操作性挑战是关于所提供资源及其依赖关系的信息交换。现有的工作通常依赖于示意图数据模型,这阻碍了自治站点之间异构资源的发现和管理。解决这个问题的一种方法是交换语义信息模型。在本文中,我们利用这些方法来正式定义联邦,包括其基础设施和所提供资源和服务的生命周期。这项工作的要求来自若干研究项目,其结果正在由一个国际机构加以标准化。这项工作的主要贡献是一个更高层次的本体和它的初始集成概念。这些贡献为分布式语义资源管理的一般上下文中的进一步工作奠定了基础。
{"title":"FIDDLE: Federated Infrastructure Discovery and Description Language","authors":"A. Willner, R. Loughnane, T. Magedanz","doi":"10.1109/IC2E.2015.77","DOIUrl":"https://doi.org/10.1109/IC2E.2015.77","url":null,"abstract":"Considerable efforts have been spent on designing architectures to manage heterogeneous resources across multiple administrative domains. Specific fields of application are federated cloud computing (Intercloud) approaches and distributed testbeds, among others. An important interoperability challenge that arises in this context is the exchange of information about the provided resources and their dependencies. Existing work usually rests upon schematic data models, which impede the discovery and management of heterogeneous resources between autonomous sites. One way of addressing this issue is to exchange semantic information models. In this paper, we exploit such approaches to formally define federations, including their infrastructures and the life-cycle of the offered resources and services. The requirements of this work have been derived from several research projects and the results are in process of being standardized by an international body. The main contribution of this work is a higher level (upper) ontology and initial integration concepts for it. These contributions form a basis for further work in the general context of distributed semantic resource management.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127384642","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
EAGER: Deployment-Time API Governance for Modern PaaS Clouds EAGER:面向现代PaaS云的部署时API治理
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.69
Hiranya Jayathilaka, C. Krintz, R. Wolski
To track, control, and compel reuse of web APIs, we investigate a new approach to API governance -- combined policy, implementation, and deployment control of web APIs. Our approach, called EAGER, provides a software architecture that integrates into PaaS platforms to support systemwide, deployment-time enforcement of governance policies. Specifically, EAGER checks for and prevents backward incompatible API changes from being deployed into production PaaS clouds, enforces service reuse, and facilitates enforcement of other best practices in software maintenance via policies. Our experiments with an EAGER prototype show that enforcing API governance at deployment-time in PaaS clouds is efficient and scalable to thousands of APIs and policies.
为了跟踪、控制和强制web API的重用,我们研究了一种新的API治理方法——将web API的策略、实现和部署控制结合起来。我们的方法称为EAGER,它提供了一个集成到PaaS平台的软件架构,以支持系统范围的、部署时实施的治理策略。具体来说,EAGER检查并防止向后不兼容的API更改部署到生产PaaS云中,加强服务重用,并通过策略促进软件维护中的其他最佳实践的实施。我们对EAGER原型的实验表明,在PaaS云中部署时实施API治理是有效的,并且可扩展到数千个API和策略。
{"title":"EAGER: Deployment-Time API Governance for Modern PaaS Clouds","authors":"Hiranya Jayathilaka, C. Krintz, R. Wolski","doi":"10.1109/IC2E.2015.69","DOIUrl":"https://doi.org/10.1109/IC2E.2015.69","url":null,"abstract":"To track, control, and compel reuse of web APIs, we investigate a new approach to API governance -- combined policy, implementation, and deployment control of web APIs. Our approach, called EAGER, provides a software architecture that integrates into PaaS platforms to support systemwide, deployment-time enforcement of governance policies. Specifically, EAGER checks for and prevents backward incompatible API changes from being deployed into production PaaS clouds, enforces service reuse, and facilitates enforcement of other best practices in software maintenance via policies. Our experiments with an EAGER prototype show that enforcing API governance at deployment-time in PaaS clouds is efficient and scalable to thousands of APIs and policies.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116744250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
A Bird's-Eye View on Modelling Malleable Multi-cloud Applications 可塑多云应用建模的鸟瞰图
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.94
Mohammad Hamdaqa
Cloud platforms advances have changed the application development landscape. Cloud platforms abstract the complexity of application delivery to enable rapid development and easy management. This changes the way development teams need to think about and deal with the underlying resources while building and managing their applications. This research describes a new methodology supported by a modeling framework to enable organizations that build cloud applications (e.g., SaaS providers) to unbiasedly exploit the cloud platform building blocks to leverage the flexibility, reliability and scalability that these platforms provide to the application layer.
云平台的进步已经改变了应用程序开发的前景。云平台抽象了应用程序交付的复杂性,以实现快速开发和易于管理。这改变了开发团队在构建和管理应用程序时需要考虑和处理底层资源的方式。本研究描述了一种由建模框架支持的新方法,使构建云应用程序的组织(例如,SaaS提供商)能够公正地利用云平台构建块来利用这些平台为应用层提供的灵活性、可靠性和可扩展性。
{"title":"A Bird's-Eye View on Modelling Malleable Multi-cloud Applications","authors":"Mohammad Hamdaqa","doi":"10.1109/IC2E.2015.94","DOIUrl":"https://doi.org/10.1109/IC2E.2015.94","url":null,"abstract":"Cloud platforms advances have changed the application development landscape. Cloud platforms abstract the complexity of application delivery to enable rapid development and easy management. This changes the way development teams need to think about and deal with the underlying resources while building and managing their applications. This research describes a new methodology supported by a modeling framework to enable organizations that build cloud applications (e.g., SaaS providers) to unbiasedly exploit the cloud platform building blocks to leverage the flexibility, reliability and scalability that these platforms provide to the application layer.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131810355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Efficient Prototyping of Fault Tolerant Map-Reduce Applications with Docker-Hadoop 基于Docker-Hadoop的容错Map-Reduce应用的高效原型
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.73
J. Rey, M. Cogorno, Sergio Nesmachnow, L. Steffenel
Prototyping and testing distributed systems is considered to be a hard task because it is not always possible to reproduce a given sequence of events. While simulations may help on this task, they cannot replace test and validation with real systems. In this paper we present Docker-Hadoop, a container-based virtualization platform designed to prototype, test and deploy MapReduce applications and systems. This tool allowed us to test and reproduce fault-tolerance scenarios that are especially interesting in the context of the PER-MARE project, which aims at adapting the Hadoop framework to the case pervasive systems. Indeed, we developed a fault-tolerant component that can circumvent the limitations from original Hadoop and prevent the job scheduling stall in the case of failures or network disconnections. Thanks to Docker-Hadoop, we could easily prototype and test our improved Hadoop, with the first scalability and speedup results being presented in this paper.
对分布式系统进行原型设计和测试被认为是一项艰巨的任务,因为不可能总是重现给定的事件序列。虽然模拟可以帮助完成这项任务,但它们不能取代真实系统的测试和验证。在本文中,我们介绍了Docker-Hadoop,这是一个基于容器的虚拟化平台,旨在对MapReduce应用程序和系统进行原型化、测试和部署。这个工具允许我们测试和重现在PER-MARE项目上下文中特别有趣的容错场景,PER-MARE项目旨在使Hadoop框架适应case普适系统。实际上,我们开发了一个容错组件,它可以规避原始Hadoop的限制,并防止在故障或网络断开的情况下作业调度停滞。多亏了Docker-Hadoop,我们可以很容易地原型化和测试改进后的Hadoop,本文给出了第一个可扩展性和加速结果。
{"title":"Efficient Prototyping of Fault Tolerant Map-Reduce Applications with Docker-Hadoop","authors":"J. Rey, M. Cogorno, Sergio Nesmachnow, L. Steffenel","doi":"10.1109/IC2E.2015.73","DOIUrl":"https://doi.org/10.1109/IC2E.2015.73","url":null,"abstract":"Prototyping and testing distributed systems is considered to be a hard task because it is not always possible to reproduce a given sequence of events. While simulations may help on this task, they cannot replace test and validation with real systems. In this paper we present Docker-Hadoop, a container-based virtualization platform designed to prototype, test and deploy MapReduce applications and systems. This tool allowed us to test and reproduce fault-tolerance scenarios that are especially interesting in the context of the PER-MARE project, which aims at adapting the Hadoop framework to the case pervasive systems. Indeed, we developed a fault-tolerant component that can circumvent the limitations from original Hadoop and prevent the job scheduling stall in the case of failures or network disconnections. Thanks to Docker-Hadoop, we could easily prototype and test our improved Hadoop, with the first scalability and speedup results being presented in this paper.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131814174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Automating Cloud Service Level Agreements Using Semantic Technologies 使用语义技术自动化云服务水平协议
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.63
K. Joshi, C. Pearce
Cloud related legal documents, like terms of service or customer agreement are usually managed as plain text files. Hence extensive manual effort is required to monitor the cloud service performance by cross referencing the metrics and measures agreed upon in these documents. We have significantly automated the process of managing and monitoring cloud Service Level Agreements (SLA) using semantic web technologies like OWL, RDF and SPARQL. In this paper, we describe in detail the cloud SLA ontology and the prototype that we have developed to illustrate how the SLA measures can be automatically extracted from legal Terms of Service that are available on cloud provider websites.
与云相关的法律文件,如服务条款或客户协议,通常以纯文本文件的形式进行管理。因此,通过交叉引用这些文档中商定的指标和度量来监视云服务性能需要大量的手工工作。我们使用语义web技术,如OWL、RDF和SPARQL,显著地自动化了管理和监控云服务水平协议(SLA)的过程。在本文中,我们详细描述了云SLA本体和我们开发的原型,以说明如何从云提供商网站上可用的法律服务条款中自动提取SLA度量。
{"title":"Automating Cloud Service Level Agreements Using Semantic Technologies","authors":"K. Joshi, C. Pearce","doi":"10.1109/IC2E.2015.63","DOIUrl":"https://doi.org/10.1109/IC2E.2015.63","url":null,"abstract":"Cloud related legal documents, like terms of service or customer agreement are usually managed as plain text files. Hence extensive manual effort is required to monitor the cloud service performance by cross referencing the metrics and measures agreed upon in these documents. We have significantly automated the process of managing and monitoring cloud Service Level Agreements (SLA) using semantic web technologies like OWL, RDF and SPARQL. In this paper, we describe in detail the cloud SLA ontology and the prototype that we have developed to illustrate how the SLA measures can be automatically extracted from legal Terms of Service that are available on cloud provider websites.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114233094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Using Trustworthy Simulation to Engineer Cloud Schedulers 使用可信模拟来设计云调度器
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.14
A. Pucher, Emre Gul, R. Wolski, C. Krintz
In recent years, researchers have contributed promising new techniques for allocating cloud resources in more robust, efficient, and ecologically sustainable ways. Unfortunately, the wide-spread use of these techniques in production systems has, to date, remained elusive. One reason for this is that the state of the art for investigating these innovations at scale often relies solely on model-driven simulation. Production-grade cloud software, however, demands certainty and precision for development and business planning that only comes from validating simulation against empirical observation. In this work, we take an alternative approach to facilitating cloud research and engineering in order to transition innovations to production deployment faster. In particular, we present a new methodology that complements existing model-driven simulation with platform-specific and statistically trustworthy results. We simulate systems at scales and on time frames that are testable, and then, based on the statistical validation of these simulations, investigate scenarios beyond those feasibly observable in practice. We demonstrate the approach by developing an energy-aware cloud scheduler and evaluating it using production and synthetic traces in faster than real time. Our results show that we can accurately simulate a production IaaS system, ease capacity planning, and expedite the reliable development of its components and extensions.
近年来,研究人员贡献了有前途的新技术,以更稳健、更有效和生态可持续的方式分配云资源。不幸的是,到目前为止,这些技术在生产系统中的广泛应用仍然难以实现。其中一个原因是,大规模研究这些创新的最新技术通常只依赖于模型驱动的模拟。然而,生产级云软件要求开发和业务计划的确定性和精确性,这只能来自于对经验观察的验证模拟。在这项工作中,我们采用另一种方法来促进云研究和工程,以便更快地将创新转化为生产部署。特别是,我们提出了一种新的方法,用特定于平台和统计可信的结果补充现有的模型驱动仿真。我们在可测试的尺度和时间框架上模拟系统,然后,基于这些模拟的统计验证,研究超出实际可观察到的场景。我们通过开发能源感知云调度器并使用比实时更快的生产和合成轨迹对其进行评估来演示该方法。我们的结果表明,我们可以准确地模拟生产IaaS系统,简化容量规划,并加快其组件和扩展的可靠开发。
{"title":"Using Trustworthy Simulation to Engineer Cloud Schedulers","authors":"A. Pucher, Emre Gul, R. Wolski, C. Krintz","doi":"10.1109/IC2E.2015.14","DOIUrl":"https://doi.org/10.1109/IC2E.2015.14","url":null,"abstract":"In recent years, researchers have contributed promising new techniques for allocating cloud resources in more robust, efficient, and ecologically sustainable ways. Unfortunately, the wide-spread use of these techniques in production systems has, to date, remained elusive. One reason for this is that the state of the art for investigating these innovations at scale often relies solely on model-driven simulation. Production-grade cloud software, however, demands certainty and precision for development and business planning that only comes from validating simulation against empirical observation. In this work, we take an alternative approach to facilitating cloud research and engineering in order to transition innovations to production deployment faster. In particular, we present a new methodology that complements existing model-driven simulation with platform-specific and statistically trustworthy results. We simulate systems at scales and on time frames that are testable, and then, based on the statistical validation of these simulations, investigate scenarios beyond those feasibly observable in practice. We demonstrate the approach by developing an energy-aware cloud scheduler and evaluating it using production and synthetic traces in faster than real time. Our results show that we can accurately simulate a production IaaS system, ease capacity planning, and expedite the reliable development of its components and extensions.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116373798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Harp: Collective Communication on Hadoop Harp: Hadoop上的集体通信
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.35
Bingjing Zhang, Yang Ruan, J. Qiu
Big data processing tools have evolved rapidly in recent years. MapReduce has proven very successful but is not optimized for many important analytics, especially those involving iteration. In this regard, Iterative MapReduce frameworks improve performance of MapReduce job chains through caching. Further, Pregel, Giraph and Graph Lab abstract data as a graph and process it in iterations. But all these tools are designed with a fixed data abstraction and have limited collective communication support to synchronize application data and algorithm control states among parallel processes. In this paper, we introduce a collective communication abstraction layer which provides efficient collective communication operations on several common data abstractions such as arrays, key-values and graphs, and define a Map Collective programming model which serves the diverse collective communication demands in different parallel algorithms. We implement a library called Harp to provide the features above and plug it into Hadoop so that applications abstracted in Map Collective model can be easily developed on top of MapReduce framework and conveniently integrated with other tools in Apache Big Data Stack. With improved expressiveness in the abstraction and excellent performance on the implementation, we can simultaneously support various applications from HPC to Cloud systems together with high performance.
近年来,大数据处理工具发展迅速。MapReduce已经被证明是非常成功的,但是对于许多重要的分析,特别是那些涉及迭代的分析,并没有进行优化。在这方面,迭代MapReduce框架通过缓存提高了MapReduce作业链的性能。此外,Pregel, Giraph和Graph Lab将数据抽象为图形并在迭代中进行处理。但是所有这些工具都是用固定的数据抽象设计的,并且在并行进程之间同步应用程序数据和算法控制状态的集体通信支持有限。在本文中,我们引入了一个集体通信抽象层,为数组、键值和图等几种常见的数据抽象提供高效的集体通信操作,并定义了一个Map collective编程模型,以满足不同并行算法中不同的集体通信需求。我们实现了一个名为Harp的库来提供上述功能,并将其插入Hadoop中,以便Map Collective模型中抽象的应用程序可以轻松地在MapReduce框架上开发,并方便地与Apache大数据堆栈中的其他工具集成。通过改进的抽象表达能力和出色的实现性能,我们可以同时支持从高性能计算到云系统的各种应用。
{"title":"Harp: Collective Communication on Hadoop","authors":"Bingjing Zhang, Yang Ruan, J. Qiu","doi":"10.1109/IC2E.2015.35","DOIUrl":"https://doi.org/10.1109/IC2E.2015.35","url":null,"abstract":"Big data processing tools have evolved rapidly in recent years. MapReduce has proven very successful but is not optimized for many important analytics, especially those involving iteration. In this regard, Iterative MapReduce frameworks improve performance of MapReduce job chains through caching. Further, Pregel, Giraph and Graph Lab abstract data as a graph and process it in iterations. But all these tools are designed with a fixed data abstraction and have limited collective communication support to synchronize application data and algorithm control states among parallel processes. In this paper, we introduce a collective communication abstraction layer which provides efficient collective communication operations on several common data abstractions such as arrays, key-values and graphs, and define a Map Collective programming model which serves the diverse collective communication demands in different parallel algorithms. We implement a library called Harp to provide the features above and plug it into Hadoop so that applications abstracted in Map Collective model can be easily developed on top of MapReduce framework and conveniently integrated with other tools in Apache Big Data Stack. With improved expressiveness in the abstraction and excellent performance on the implementation, we can simultaneously support various applications from HPC to Cloud systems together with high performance.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"206 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115739348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Comparing Containers versus Virtual Machines for Achieving High Availability 比较容器和虚拟机实现高可用性
Pub Date : 2015-03-09 DOI: 10.1109/IC2E.2015.79
Wubin Li, A. Kanso
In recent decades, virtualization as an abstraction from physical hardware has become a popular solution to resource isolation and server consolidation. With the surge in adoption of virtualization technologies, ensuring High Availability (HA) for applications hosted in virtualized environments emerges as an important problem and has garnered substantial attention. In this paper, we present a brief comparison of virtualization technologies from a HA perspective. The state-of-the-art HA solutions in two mainstream types of virtualized platforms (i.e., hypervisor-based platform and container-based platform) are respectively investigated in terms of limitations and features such as live migration, failure detection, and checkpoint/ restore. One of our key findings is that, compared with hypervisor-based platforms, HA features in container-based platforms are far from enough. From a HA perspective, extensions on top of container technologies are required.
近几十年来,作为物理硬件抽象的虚拟化已经成为资源隔离和服务器整合的流行解决方案。随着虚拟化技术的大量采用,确保托管在虚拟化环境中的应用程序的高可用性(HA)已成为一个重要问题,并引起了广泛关注。在本文中,我们从HA的角度对虚拟化技术进行了简要的比较。本文分别研究了两种主流虚拟化平台(即基于管理程序的平台和基于容器的平台)中最先进的HA解决方案的局限性和特性,如实时迁移、故障检测和检查点/恢复。我们的一个重要发现是,与基于管理程序的平台相比,基于容器的平台中的HA功能远远不够。从HA的角度来看,需要在容器技术之上进行扩展。
{"title":"Comparing Containers versus Virtual Machines for Achieving High Availability","authors":"Wubin Li, A. Kanso","doi":"10.1109/IC2E.2015.79","DOIUrl":"https://doi.org/10.1109/IC2E.2015.79","url":null,"abstract":"In recent decades, virtualization as an abstraction from physical hardware has become a popular solution to resource isolation and server consolidation. With the surge in adoption of virtualization technologies, ensuring High Availability (HA) for applications hosted in virtualized environments emerges as an important problem and has garnered substantial attention. In this paper, we present a brief comparison of virtualization technologies from a HA perspective. The state-of-the-art HA solutions in two mainstream types of virtualized platforms (i.e., hypervisor-based platform and container-based platform) are respectively investigated in terms of limitations and features such as live migration, failure detection, and checkpoint/ restore. One of our key findings is that, compared with hypervisor-based platforms, HA features in container-based platforms are far from enough. From a HA perspective, extensions on top of container technologies are required.","PeriodicalId":395715,"journal":{"name":"2015 IEEE International Conference on Cloud Engineering","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132678488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
期刊
2015 IEEE International Conference on Cloud Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1