首页 > 最新文献

2014 IEEE 7th International Conference on Cloud Computing最新文献

英文 中文
Encrypted Scalar Product Protocol for Outsourced Data Mining 外包数据挖掘的加密标量乘积协议
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.53
Fang Liu, W. Ng, Wei Zhang
Organizations and individuals nowadays face increasing daily operations closely rely on a huge amount of private data which is outsourced to a centralized server. Secure and efficient data processing and mining on such outsourced private data becomes a primary concern for users, especially with the push of cloud computing which has both resource and compute scalability. Among the building blocks of secure data mining algorithms, secure scalar product is used to calculate the sum of the products of the corresponding values of two vectors. Existing privacy preserving methods assume data is stored at the user side, and users follow a protocol to perform privacy preserving scalar product. However, such methods are not applicable as data now is outsourced to a centralized server in its encrypted form. To solve this problem, in this paper, we design a novel Protocol for Outsourced Scalar Product (POSP) that performs collaborative operations between server and users to produce the scalar product result without violating each user's data privacy. We proved that POSP can return the correct result and is secure. We also analysed that POSP has linear complexity in terms of space, computation, and communication with respect to the vector length.
如今,组织和个人面临越来越多的日常操作,这些操作密切依赖于外包给中央服务器的大量私人数据。安全高效地处理和挖掘此类外包私有数据成为用户关注的首要问题,特别是随着云计算的发展,云计算具有资源和计算可扩展性。在安全数据挖掘算法的构建块中,安全标量积用于计算两个向量对应值的乘积的和。现有的隐私保护方法假设数据存储在用户端,用户按照协议执行隐私保护标量积。然而,这些方法并不适用,因为现在数据以加密形式外包给中央服务器。为了解决这一问题,本文设计了一种新的外包标量积协议(POSP),该协议在服务器和用户之间进行协作操作,以产生标量积结果,而不侵犯每个用户的数据隐私。证明了POSP可以返回正确的结果,并且是安全的。我们还分析了POSP在空间、计算和通信方面相对于向量长度具有线性复杂性。
{"title":"Encrypted Scalar Product Protocol for Outsourced Data Mining","authors":"Fang Liu, W. Ng, Wei Zhang","doi":"10.1109/CLOUD.2014.53","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.53","url":null,"abstract":"Organizations and individuals nowadays face increasing daily operations closely rely on a huge amount of private data which is outsourced to a centralized server. Secure and efficient data processing and mining on such outsourced private data becomes a primary concern for users, especially with the push of cloud computing which has both resource and compute scalability. Among the building blocks of secure data mining algorithms, secure scalar product is used to calculate the sum of the products of the corresponding values of two vectors. Existing privacy preserving methods assume data is stored at the user side, and users follow a protocol to perform privacy preserving scalar product. However, such methods are not applicable as data now is outsourced to a centralized server in its encrypted form. To solve this problem, in this paper, we design a novel Protocol for Outsourced Scalar Product (POSP) that performs collaborative operations between server and users to produce the scalar product result without violating each user's data privacy. We proved that POSP can return the correct result and is secure. We also analysed that POSP has linear complexity in terms of space, computation, and communication with respect to the vector length.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127885162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Multicast Virtual Network Embedding in Cloud Data Centers with Delay Constraints 时延约束下云数据中心组播虚拟网络嵌入
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.71
Sara Ayoubi, K. Shaban, C. Assi
Network virtualization enables the multi-tenancy concept and paves the way towards more advancements and innovation in the underlying infrastructure. With network virtualization, allocating resources to Virtual Networks (VNs) that represent tenants' requests emerges as a challenging problem. This problem is commonly known as the Virtual Network Embedding (VNE) problem, and its NP-Hard nature has drawn a lot of attention from the research community. A common feature in the existing work is that the type of communication in the VN requests was never characterized, assuming that they exhibit unicast communication only. In this paper, we motivate the importance of characterizing the type of communication in VN requests. We present a formal definition of the VNE problem for VNs with multicast communication. To the best of our knowledge, the multicast VNE problem has not been addressed in the frame of cloud computing, where the location of all the virtual machines in a given multicast VN is unknown. We propose a novel 3-steps heuristic to solve the multicast VNE problem with end-delay and delay variation constraints. Our numerical results prove the efficiency of our suggested approach over multiple metrics and against numerous embedding heuristics.
网络虚拟化支持多租户概念,并为底层基础设施中的更多进步和创新铺平了道路。在网络虚拟化中,将资源分配给代表租户请求的虚拟网络(Virtual network, VNs)是一个具有挑战性的问题。这个问题通常被称为虚拟网络嵌入(VNE)问题,它的NP-Hard性质引起了学术界的广泛关注。现有工作中的一个共同特点是,从未对VN请求中的通信类型进行描述,假设它们只显示单播通信。在本文中,我们提出了表征VN请求中通信类型的重要性。给出了具有组播通信的VNs的VNE问题的形式化定义。据我们所知,在云计算的框架中,组播VNE问题还没有得到解决,在云计算中,给定组播VN中所有虚拟机的位置是未知的。我们提出了一种新的三步启发式算法来解决具有端延迟和延迟变化约束的组播VNE问题。我们的数值结果证明了我们所提出的方法在多个度量和多种嵌入启发式下的有效性。
{"title":"Multicast Virtual Network Embedding in Cloud Data Centers with Delay Constraints","authors":"Sara Ayoubi, K. Shaban, C. Assi","doi":"10.1109/CLOUD.2014.71","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.71","url":null,"abstract":"Network virtualization enables the multi-tenancy concept and paves the way towards more advancements and innovation in the underlying infrastructure. With network virtualization, allocating resources to Virtual Networks (VNs) that represent tenants' requests emerges as a challenging problem. This problem is commonly known as the Virtual Network Embedding (VNE) problem, and its NP-Hard nature has drawn a lot of attention from the research community. A common feature in the existing work is that the type of communication in the VN requests was never characterized, assuming that they exhibit unicast communication only. In this paper, we motivate the importance of characterizing the type of communication in VN requests. We present a formal definition of the VNE problem for VNs with multicast communication. To the best of our knowledge, the multicast VNE problem has not been addressed in the frame of cloud computing, where the location of all the virtual machines in a given multicast VN is unknown. We propose a novel 3-steps heuristic to solve the multicast VNE problem with end-delay and delay variation constraints. Our numerical results prove the efficiency of our suggested approach over multiple metrics and against numerous embedding heuristics.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126563924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Improving Users' Isolation in IaaS: Virtual Machine Placement with Security Constraints 改进IaaS中的用户隔离:具有安全约束的虚拟机放置
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.19
E. Caron, Jonathan Rouzaud-Cornabas
Nowadays, virtualization is used as the sole mechanism to isolate different users on Cloud platforms. In this paper, we show that, due to improper virtualization of micro-architectural components, data leak and modification can occur on public Clouds. Furthermore, using the same vector, it is possible to induce performance interferences, i.e. noisy neighbors. Using this approach, a VM can steal resources from, and slow down, concurrent VMs. To counter this, we propose placement heuristics that take into account isolation requirements, thus allowing a user to specify the level of isolation he accepts, and with whom. We modify 3 classical heuristics to take into account these requirements. In addition, we propose 4 new heuristics that take into account the hierarchy of Cloud platforms and isolation requirements. Finally, we evaluate these heuristics and compare them with the modified classical ones. We show that our heuristics perform at least as well as the classical ones, while scaling better and being faster by a few orders of magnitude.
如今,虚拟化被用作隔离云平台上不同用户的唯一机制。在本文中,我们表明,由于微架构组件虚拟化不当,可能会在公共云上发生数据泄漏和修改。此外,使用相同的矢量,有可能诱导性能干扰,即噪声邻居。通过这种方式,虚拟机可以窃取并发虚拟机的资源,降低并发虚拟机的运行速度。为了解决这个问题,我们提出了考虑隔离要求的放置启发式方法,从而允许用户指定他接受的隔离级别以及与谁进行隔离。我们修改了3个经典的启发式来考虑这些要求。此外,考虑到云平台的层次结构和隔离要求,我们提出了4种新的启发式方法。最后,我们对这些启发式方法进行了评价,并与改进后的经典启发式方法进行了比较。我们表明,我们的启发式算法至少和经典算法一样好,同时扩展得更好,速度也快了几个数量级。
{"title":"Improving Users' Isolation in IaaS: Virtual Machine Placement with Security Constraints","authors":"E. Caron, Jonathan Rouzaud-Cornabas","doi":"10.1109/CLOUD.2014.19","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.19","url":null,"abstract":"Nowadays, virtualization is used as the sole mechanism to isolate different users on Cloud platforms. In this paper, we show that, due to improper virtualization of micro-architectural components, data leak and modification can occur on public Clouds. Furthermore, using the same vector, it is possible to induce performance interferences, i.e. noisy neighbors. Using this approach, a VM can steal resources from, and slow down, concurrent VMs. To counter this, we propose placement heuristics that take into account isolation requirements, thus allowing a user to specify the level of isolation he accepts, and with whom. We modify 3 classical heuristics to take into account these requirements. In addition, we propose 4 new heuristics that take into account the hierarchy of Cloud platforms and isolation requirements. Finally, we evaluate these heuristics and compare them with the modified classical ones. We show that our heuristics perform at least as well as the classical ones, while scaling better and being faster by a few orders of magnitude.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"372 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115905146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Reliability and Utilization Evaluation of a Cloud Computing System Allowing Partial Failures 允许部分故障的云计算系统可靠性和利用率评估
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.131
Congyingzi Zhang, Robert C. Green, Mansoor Alam
Maintaining high reliability and device utilization in a cloud computing system (CCS) is crucial to any cloud service provider who will face high penalties and lose revenues if they fail to be good at both. This study proposes that allowing device partial failure in a CCS for graceful service degrading would help to obtain higher system reliability and device utilization without purchasing extra resource for the system. A model is created to represent such a multi-state system composed of multi-state devices. The system model is evaluated with Non-sequential Monte Carlo Simulation (MCS) on its reliability and device utilization. The preliminary results positively suggest that introducing and adding device multi-state increases the CCS reliability against device failures during simulation. Also, for the less reliable devices, like HDD, the results recommended a higher multi-state to compensate for their vulnerability and negative effect on system performance. A utilization index along all device dimensions is proposed in this research for a wise decision about maintaining a well-balanced and high utilized system at a lower cost.
在云计算系统(CCS)中保持高可靠性和设备利用率对任何云服务提供商来说都是至关重要的,如果他们不能做好这两件事,他们将面临高额罚款和收入损失。本研究提出在CCS中允许设备部分故障以实现优雅的服务降级,将有助于在不为系统购买额外资源的情况下获得更高的系统可靠性和设备利用率。建立了一个模型来表示由多状态设备组成的多状态系统。用非顺序蒙特卡罗仿真(MCS)对系统模型的可靠性和器件利用率进行了评估。初步结果表明,引入和增加设备多状态可以提高CCS在设备故障时的可靠性。此外,对于不太可靠的设备(如HDD),结果建议使用更高的多状态,以补偿它们的脆弱性和对系统性能的负面影响。在本研究中,提出了一个沿所有设备尺寸的利用率指数,以明智地决定如何以较低的成本维持一个平衡和高利用率的系统。
{"title":"Reliability and Utilization Evaluation of a Cloud Computing System Allowing Partial Failures","authors":"Congyingzi Zhang, Robert C. Green, Mansoor Alam","doi":"10.1109/CLOUD.2014.131","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.131","url":null,"abstract":"Maintaining high reliability and device utilization in a cloud computing system (CCS) is crucial to any cloud service provider who will face high penalties and lose revenues if they fail to be good at both. This study proposes that allowing device partial failure in a CCS for graceful service degrading would help to obtain higher system reliability and device utilization without purchasing extra resource for the system. A model is created to represent such a multi-state system composed of multi-state devices. The system model is evaluated with Non-sequential Monte Carlo Simulation (MCS) on its reliability and device utilization. The preliminary results positively suggest that introducing and adding device multi-state increases the CCS reliability against device failures during simulation. Also, for the less reliable devices, like HDD, the results recommended a higher multi-state to compensate for their vulnerability and negative effect on system performance. A utilization index along all device dimensions is proposed in this research for a wise decision about maintaining a well-balanced and high utilized system at a lower cost.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131549612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
RESeED: Regular Expression Search over Encrypted Data in the Cloud RESeED:对云中的加密数据进行正则表达式搜索
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.95
M. Salehi, T. Caldwell, Alejandro Fernandez, Emmanuel Mickiewicz, Eric Rozier, S. Zonouz, David Redberg
Capabilities for trustworthy cloud-based computing and data storage require usable, secure and efficient solutions which allow clients to remotely store and process their data in the cloud. In this paper, we present RESeED, a tool which provides user-transparent and cloud-agnostic search over encrypted data using regular expressions without requiring cloud providers to make changes to their existing infrastructure. When a client asks RESeED to upload a new file in the cloud, RESeED analyzes the file's content and updates novel data structures accordingly, encrypting and transferring the new data to the cloud. RESeED provides regular expression search over this encrypted data by translating queries on-the-fly to finite automata and analyzes efficient and secure representations of the data before asking the cloud to download the encrypted files. We evaulate a working prototype of RESeED experimentally (currently publicly available) and show the scalability and correctness of our approach using real-world data sets from arXiv.org and the IETF. We show absolute accuracy for RESeED, with very low (6%) overhead, and high performability, even beating grep for some benchmarks.
值得信赖的基于云的计算和数据存储能力需要可用、安全和高效的解决方案,这些解决方案允许客户在云中远程存储和处理数据。在本文中,我们介绍了RESeED,这是一个使用正则表达式对加密数据提供用户透明且与云无关的搜索的工具,而不需要云提供商对其现有基础设施进行更改。当客户端要求RESeED在云中上传新文件时,RESeED分析文件的内容并相应地更新新的数据结构,加密并将新数据传输到云中。RESeED通过动态地将查询转换为有限自动机,在加密数据上提供正则表达式搜索,并在要求云下载加密文件之前分析数据的高效和安全表示。我们通过实验评估了resseed的工作原型(目前公开可用),并使用来自arXiv.org和IETF的真实数据集展示了我们方法的可扩展性和正确性。我们展示了RESeED的绝对准确性,开销非常低(6%),性能很高,在一些基准测试中甚至超过了grep。
{"title":"RESeED: Regular Expression Search over Encrypted Data in the Cloud","authors":"M. Salehi, T. Caldwell, Alejandro Fernandez, Emmanuel Mickiewicz, Eric Rozier, S. Zonouz, David Redberg","doi":"10.1109/CLOUD.2014.95","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.95","url":null,"abstract":"Capabilities for trustworthy cloud-based computing and data storage require usable, secure and efficient solutions which allow clients to remotely store and process their data in the cloud. In this paper, we present RESeED, a tool which provides user-transparent and cloud-agnostic search over encrypted data using regular expressions without requiring cloud providers to make changes to their existing infrastructure. When a client asks RESeED to upload a new file in the cloud, RESeED analyzes the file's content and updates novel data structures accordingly, encrypting and transferring the new data to the cloud. RESeED provides regular expression search over this encrypted data by translating queries on-the-fly to finite automata and analyzes efficient and secure representations of the data before asking the cloud to download the encrypted files. We evaulate a working prototype of RESeED experimentally (currently publicly available) and show the scalability and correctness of our approach using real-world data sets from arXiv.org and the IETF. We show absolute accuracy for RESeED, with very low (6%) overhead, and high performability, even beating grep for some benchmarks.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131259386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 22
Lego4TOSCA: Composable Building Blocks for Cloud Applications Lego4TOSCA:云应用的可组合构建块
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.31
Florian Haupt, F. Leymann, Alexander Nowak, S. Wagner
The Topology and Orchestration Specification for Cloud Applications (TOSCA) enables the description, provisioning, and management of complex cloud applications in a portable way. TOSCA, therefore, provides a comprehensive yet complex set of mechanisms that may hinder users from unleashing its power due to misusing or neglecting parts of those mechanisms. TOSCA has just been standardized and, although it seems to be highly adopted in industry, there is a lack of systematic research of its features and capabilities. In this work we discuss the design of basic building blocks for cloud applications, called node types, and show how they can benefit from a deep integration with TOSCA. We developed a generic architecture for the realization of TOSCA node types, show an implementation of this architecture and validate it based on a sample cloud application. Our work gives an insight into the capabilities of TOSCA with respect to enable the creation of portable cloud services based on a set of composable building blocks.
云应用的拓扑和编排规范(TOSCA)支持以一种可移植的方式对复杂的云应用进行描述、配置和管理。因此,TOSCA提供了一组全面而复杂的机制,由于滥用或忽视这些机制的某些部分,这些机制可能会阻碍用户释放其功能。TOSCA刚刚标准化,虽然它在工业中似乎被高度采用,但缺乏对其特性和功能的系统研究。在本文中,我们将讨论云应用程序的基本构建块(称为节点类型)的设计,并展示它们如何从与TOSCA的深度集成中获益。我们为TOSCA节点类型的实现开发了一个通用体系结构,展示了该体系结构的实现,并基于示例云应用程序对其进行了验证。我们的工作深入了解了TOSCA在支持基于一组可组合构建块创建可移植云服务方面的功能。
{"title":"Lego4TOSCA: Composable Building Blocks for Cloud Applications","authors":"Florian Haupt, F. Leymann, Alexander Nowak, S. Wagner","doi":"10.1109/CLOUD.2014.31","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.31","url":null,"abstract":"The Topology and Orchestration Specification for Cloud Applications (TOSCA) enables the description, provisioning, and management of complex cloud applications in a portable way. TOSCA, therefore, provides a comprehensive yet complex set of mechanisms that may hinder users from unleashing its power due to misusing or neglecting parts of those mechanisms. TOSCA has just been standardized and, although it seems to be highly adopted in industry, there is a lack of systematic research of its features and capabilities. In this work we discuss the design of basic building blocks for cloud applications, called node types, and show how they can benefit from a deep integration with TOSCA. We developed a generic architecture for the realization of TOSCA node types, show an implementation of this architecture and validate it based on a sample cloud application. Our work gives an insight into the capabilities of TOSCA with respect to enable the creation of portable cloud services based on a set of composable building blocks.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"34 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128891595","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Taming Computation Skews of Block-Oriented Iterative Scientific Applications in MapReduce Systems MapReduce系统中面向块的迭代科学应用的驯服计算偏差
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.33
Xin Yang, Min Li, Ze Yu, Xiaolin Li
Nowadays, scientists are embracing big data techniques for exploring significant discoveries from large volumes of scientific data quickly. Properly partitioning workloads is essential for fully exploiting the benefit of parallelism, but is difficult for applications whose computations change iteratively. Computation skews are inevitable when executing block-oriented iterative scientific applications in MapReduce systems. This paper proposes iPart, an autonomic workload partitioning system for taming computation skews of block-oriented iterative scientific applications in MapReduce systems. iPart introduces a workload control loop into the conventional execution of MapReduce jobs. Workload estimates in terms of execution time are collected in the reduce phase and fed back to the partition phase to update partitioning plans. Computation skews are detected and addressed by adapting partitioning to computation changes iteratively. Two adaptive partitioning methods based on the binary partitioning method are presented. Experimental evaluations with two simulated applications and the synthetic and real-world data prove that iPart responds to computation changes and adapts partitioning quickly and accurately.
如今,科学家们正在采用大数据技术,从大量的科学数据中快速探索重大发现。正确划分工作负载对于充分利用并行性的好处至关重要,但是对于计算迭代变化的应用程序来说很难。在MapReduce系统中执行面向块的迭代科学应用程序时,计算偏差是不可避免的。针对MapReduce系统中面向块的迭代科学应用的计算倾斜,提出了一种自主工作负载划分系统iPart。iPart在MapReduce作业的常规执行中引入了一个工作负载控制循环。在reduce阶段收集执行时间方面的工作负载估计,并将其反馈到分区阶段以更新分区计划。通过迭代地调整分区以适应计算变化来检测和解决计算倾斜。在二元划分方法的基础上,提出了两种自适应划分方法。两个模拟应用程序的实验评估以及综合和实际数据证明了iPart对计算变化的响应和对分区的快速准确适应。
{"title":"Taming Computation Skews of Block-Oriented Iterative Scientific Applications in MapReduce Systems","authors":"Xin Yang, Min Li, Ze Yu, Xiaolin Li","doi":"10.1109/CLOUD.2014.33","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.33","url":null,"abstract":"Nowadays, scientists are embracing big data techniques for exploring significant discoveries from large volumes of scientific data quickly. Properly partitioning workloads is essential for fully exploiting the benefit of parallelism, but is difficult for applications whose computations change iteratively. Computation skews are inevitable when executing block-oriented iterative scientific applications in MapReduce systems. This paper proposes iPart, an autonomic workload partitioning system for taming computation skews of block-oriented iterative scientific applications in MapReduce systems. iPart introduces a workload control loop into the conventional execution of MapReduce jobs. Workload estimates in terms of execution time are collected in the reduce phase and fed back to the partition phase to update partitioning plans. Computation skews are detected and addressed by adapting partitioning to computation changes iteratively. Two adaptive partitioning methods based on the binary partitioning method are presented. Experimental evaluations with two simulated applications and the synthetic and real-world data prove that iPart responds to computation changes and adapts partitioning quickly and accurately.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124140858","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing IaaS Reserved Contract Procurement Using Load Prediction 利用负荷预测优化IaaS保留合同采购
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.22
R. Y. V. Bossche, K. Vanmechelen, J. Broeckhove
With the increased adoption of cloud computing, new challenges have emerged related to the cost-effective management of cloud resources. The proliferation of resource properties and pricing plans has made the selection, procurement and management of cloud resources a time-consuming and complex task, which stands to benefit from automation. This contribution focuses on the procurement decision of reserved contracts in the context of Infrastructure-as-a-Service (IaaS) providers such as Amazon EC2. Such reserved contracts complement pay-by-the-hour pricing models, and offer a significant reduction in price (up to 70%) for a particular period in return for an upfront payment. Thus, customers can reduce costs by predicting and analyzing their future needs in terms of the number and type of server instances. We present an algorithm that uses load prediction with automated time series forecasting based on a Double-seasonal Holt-Winters model, in order to make cost-efficient purchasing decisions among a wide range of contract types while taking into account an organization's current contract portfolio. We analyze its cost effectiveness through simulation of real-world web traffic traces. Our analysis investigates the impact of different prediction techniques on cost compared to a clairvoyant predictor and compares the algorithm's performance with a stationary contract renewal approach. Our results show that the algorithm is able to significantly reduce IaaS resource costs through automated reserved contract procurement. Moreover, the algorithm's computational cost makes it applicable to large-scale real-world settings.
随着越来越多地采用云计算,出现了与云资源的成本效益管理相关的新挑战。资源属性和定价计划的激增使得云资源的选择、采购和管理成为一项耗时且复杂的任务,这将从自动化中受益。该贡献主要关注基础设施即服务(IaaS)提供商(如Amazon EC2)上下文中保留合同的采购决策。这种保留合同补充了按小时付费的定价模式,并在特定时期提供大幅降价(最高70%),以换取预付款。因此,客户可以根据服务器实例的数量和类型来预测和分析他们未来的需求,从而降低成本。我们提出了一种基于双季节霍尔特-温特斯模型的负荷预测和自动时间序列预测的算法,以便在考虑组织当前合同组合的同时,在广泛的合同类型中做出具有成本效益的采购决策。我们通过模拟真实世界的网络流量轨迹来分析其成本效益。我们的分析调查了与千里眼预测器相比,不同预测技术对成本的影响,并将算法的性能与固定合同续订方法进行了比较。我们的研究结果表明,该算法能够通过自动保留合同采购显著降低IaaS资源成本。此外,该算法的计算成本使其适用于大规模的现实世界设置。
{"title":"Optimizing IaaS Reserved Contract Procurement Using Load Prediction","authors":"R. Y. V. Bossche, K. Vanmechelen, J. Broeckhove","doi":"10.1109/CLOUD.2014.22","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.22","url":null,"abstract":"With the increased adoption of cloud computing, new challenges have emerged related to the cost-effective management of cloud resources. The proliferation of resource properties and pricing plans has made the selection, procurement and management of cloud resources a time-consuming and complex task, which stands to benefit from automation. This contribution focuses on the procurement decision of reserved contracts in the context of Infrastructure-as-a-Service (IaaS) providers such as Amazon EC2. Such reserved contracts complement pay-by-the-hour pricing models, and offer a significant reduction in price (up to 70%) for a particular period in return for an upfront payment. Thus, customers can reduce costs by predicting and analyzing their future needs in terms of the number and type of server instances. We present an algorithm that uses load prediction with automated time series forecasting based on a Double-seasonal Holt-Winters model, in order to make cost-efficient purchasing decisions among a wide range of contract types while taking into account an organization's current contract portfolio. We analyze its cost effectiveness through simulation of real-world web traffic traces. Our analysis investigates the impact of different prediction techniques on cost compared to a clairvoyant predictor and compares the algorithm's performance with a stationary contract renewal approach. Our results show that the algorithm is able to significantly reduce IaaS resource costs through automated reserved contract procurement. Moreover, the algorithm's computational cost makes it applicable to large-scale real-world settings.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123441140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Elasticity Management in Private and Hybrid Clouds 私有云和混合云中弹性管理
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.110
Rhodney Simões, C. Kamienski
Cloud computing requires elasticity management for dynamically allocating and releasing resources. Even though the adoption of cloud services has been growing, there is little knowledge available for guiding users when they need to manage elasticity. This paper analyzes elasticity in private and hybrid clouds, using a university lab, PlanetLab and Amazon EC2. Results show that the choice of metrics and thresholds plays a key role in meeting performance levels and controlling costs and that cloudburst can be effectively used for a hybrid cloud but the choice of the type of virtual machine in the provider has a significant impact.
云计算需要弹性管理来动态分配和释放资源。尽管云服务的采用一直在增长,但在需要管理弹性时,指导用户的知识却很少。本文以一个大学实验室、PlanetLab和Amazon EC2为例,分析了私有云和混合云中的弹性。结果表明,度量标准和阈值的选择在满足性能水平和控制成本方面起着关键作用,cloudburst可以有效地用于混合云,但提供商中虚拟机类型的选择具有重大影响。
{"title":"Elasticity Management in Private and Hybrid Clouds","authors":"Rhodney Simões, C. Kamienski","doi":"10.1109/CLOUD.2014.110","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.110","url":null,"abstract":"Cloud computing requires elasticity management for dynamically allocating and releasing resources. Even though the adoption of cloud services has been growing, there is little knowledge available for guiding users when they need to manage elasticity. This paper analyzes elasticity in private and hybrid clouds, using a university lab, PlanetLab and Amazon EC2. Results show that the choice of metrics and thresholds plays a key role in meeting performance levels and controlling costs and that cloudburst can be effectively used for a hybrid cloud but the choice of the type of virtual machine in the provider has a significant impact.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121925580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
HaSTE: Hadoop YARN Scheduling Based on Task-Dependency and Resource-Demand 基于任务依赖和资源需求的Hadoop YARN调度
Pub Date : 2014-06-27 DOI: 10.1109/CLOUD.2014.34
Yi Yao, Jiayin Wang, B. Sheng, Jason H. Lin, N. Mi
The MapReduce framework has become the de facto scheme for scalable semi-structured and un-structured data processing in recent years. The Hadoop ecosystem has evolved into its second generation, Hadoop YARN, which adopts fine-grained resource management schemes for job scheduling. One of the primary performance concerns in YARN is how to minimize the total completion length, i.e., makespan, of a set of MapReduce jobs. However, the precedence constraint or fairness constraint in current widely used scheduling policies in YARN, such as FIFO and Fair, can both lead to inefficient resource allocation in the Hadoop YARN cluster. They also omit the dependency between tasks which is crucial for the efficiency of resource utilization. We thus propose a new YARN scheduler, named HaSTE, which can effectively reduce the makespan of MapReduce jobs in YARN by leveraging the information of requested resources, resource capacities, and dependency between tasks. We implemented HaSTE as a pluggable scheduler in the most recent version of Hadoop YARN, and evaluated it with classic MapReduce benchmarks. The experimental results demonstrate that our YARN scheduler effectively reduces the makespans and improves resource utilization compare to the current scheduling policies.
近年来,MapReduce框架已经成为可扩展的半结构化和非结构化数据处理的实际方案。Hadoop生态系统已经发展到第二代,Hadoop YARN,它采用细粒度的资源管理方案来进行作业调度。YARN的主要性能问题之一是如何最小化一组MapReduce作业的总完成长度,即makespan。然而,目前YARN中广泛使用的调度策略,如FIFO和Fair,其优先级约束或公平性约束都会导致Hadoop YARN集群的资源分配效率低下。它们还忽略了对资源利用效率至关重要的任务之间的依赖关系。因此,我们提出了一个新的YARN调度器,命名为HaSTE,它可以通过利用请求资源、资源容量和任务之间的依赖关系信息,有效地减少YARN中MapReduce作业的makespan。我们在最新版本的Hadoop YARN中实现了一个可插拔的调度程序,并使用经典的MapReduce基准测试对其进行了评估。实验结果表明,与现有的调度策略相比,YARN调度策略有效地降低了makespans,提高了资源利用率。
{"title":"HaSTE: Hadoop YARN Scheduling Based on Task-Dependency and Resource-Demand","authors":"Yi Yao, Jiayin Wang, B. Sheng, Jason H. Lin, N. Mi","doi":"10.1109/CLOUD.2014.34","DOIUrl":"https://doi.org/10.1109/CLOUD.2014.34","url":null,"abstract":"The MapReduce framework has become the de facto scheme for scalable semi-structured and un-structured data processing in recent years. The Hadoop ecosystem has evolved into its second generation, Hadoop YARN, which adopts fine-grained resource management schemes for job scheduling. One of the primary performance concerns in YARN is how to minimize the total completion length, i.e., makespan, of a set of MapReduce jobs. However, the precedence constraint or fairness constraint in current widely used scheduling policies in YARN, such as FIFO and Fair, can both lead to inefficient resource allocation in the Hadoop YARN cluster. They also omit the dependency between tasks which is crucial for the efficiency of resource utilization. We thus propose a new YARN scheduler, named HaSTE, which can effectively reduce the makespan of MapReduce jobs in YARN by leveraging the information of requested resources, resource capacities, and dependency between tasks. We implemented HaSTE as a pluggable scheduler in the most recent version of Hadoop YARN, and evaluated it with classic MapReduce benchmarks. The experimental results demonstrate that our YARN scheduler effectively reduces the makespans and improves resource utilization compare to the current scheduling policies.","PeriodicalId":288542,"journal":{"name":"2014 IEEE 7th International Conference on Cloud Computing","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2014-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124808480","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
期刊
2014 IEEE 7th International Conference on Cloud Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1