Adaptive Scheduling on Power-Aware Managed Data-Centers Using Machine Learning

2011 IEEE/ACM 12th International Conference on Grid Computing Pub Date : 2011-09-21 DOI:10.1109/GRID.2011.18

J. L. Berral, Ricard Gavaldà, J. Torres

{"title":"Adaptive Scheduling on Power-Aware Managed Data-Centers Using Machine Learning","authors":"J. L. Berral, Ricard Gavaldà, J. Torres","doi":"10.1109/GRID.2011.18","DOIUrl":null,"url":null,"abstract":"Energy-related costs have become one of the major economic factors in IT data-centers, and companies and the research community are currently working on new efficient power-aware resource management strategies, also known as \"Green IT\". Here we propose a framework for autonomic scheduling of tasks and web-services on cloud environments, optimizing the profit taking into account revenue for task execution minus penalties for service-level agreement violations, minus power consumption cost. The principal contribution is the combination of consolidation and virtualization technologies, mathematical optimization methods, and machine learning techniques. The data-center infrastructure, tasks to execute, and desired profit are casted as a mathematical programming model, which can then be solved in different ways to find good task scheduling. We use an exact solver based on mixed linear programming as a proof of concept but, since it is an NP-complete problem, we show that approximate solvers provide valid alternatives for finding approximately optimal schedules. The machine learning is used to estimate the initially unknown parameters of the mathematical model. In particular, we need to predict a priori resource usage (such as CPU consumption) by different tasks under current workloads, and estimate task service-level-agreement (such as response time) given workload features, host characteristics, and contention among tasks in the same host. Experiments show that machine learning algorithms can predict system behavior with acceptable accuracy, and that their combination with the exact or approximate schedulers manages to allocate tasks to hosts striking a balance between revenue for executed tasks, quality of service, and power consumption.","PeriodicalId":308086,"journal":{"name":"2011 IEEE/ACM 12th International Conference on Grid Computing","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"60","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE/ACM 12th International Conference on Grid Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/GRID.2011.18","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 60

Abstract

Energy-related costs have become one of the major economic factors in IT data-centers, and companies and the research community are currently working on new efficient power-aware resource management strategies, also known as "Green IT". Here we propose a framework for autonomic scheduling of tasks and web-services on cloud environments, optimizing the profit taking into account revenue for task execution minus penalties for service-level agreement violations, minus power consumption cost. The principal contribution is the combination of consolidation and virtualization technologies, mathematical optimization methods, and machine learning techniques. The data-center infrastructure, tasks to execute, and desired profit are casted as a mathematical programming model, which can then be solved in different ways to find good task scheduling. We use an exact solver based on mixed linear programming as a proof of concept but, since it is an NP-complete problem, we show that approximate solvers provide valid alternatives for finding approximately optimal schedules. The machine learning is used to estimate the initially unknown parameters of the mathematical model. In particular, we need to predict a priori resource usage (such as CPU consumption) by different tasks under current workloads, and estimate task service-level-agreement (such as response time) given workload features, host characteristics, and contention among tasks in the same host. Experiments show that machine learning algorithms can predict system behavior with acceptable accuracy, and that their combination with the exact or approximate schedulers manages to allocate tasks to hosts striking a balance between revenue for executed tasks, quality of service, and power consumption.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于机器学习的电力感知管理数据中心的自适应调度

与能源相关的成本已经成为IT数据中心的主要经济因素之一，公司和研究团体目前正在研究新的高效的能源感知资源管理策略，也被称为“绿色IT”。在这里，我们提出了一个框架，用于在云环境中自动调度任务和web服务，将任务执行的收入减去违反服务水平协议的处罚，减去功耗成本，从而优化利润。主要的贡献是整合和虚拟化技术、数学优化方法和机器学习技术的结合。数据中心基础设施、要执行的任务和期望的利润被转换成一个数学规划模型，然后可以用不同的方法对其进行求解，以找到良好的任务调度。我们使用基于混合线性规划的精确解算器作为概念证明，但是，由于它是一个np完全问题，我们表明近似解算器为寻找近似最优调度提供了有效的替代方案。机器学习用于估计数学模型的初始未知参数。特别是，我们需要预测当前工作负载下不同任务的先验资源使用情况(如CPU消耗)，并在给定工作负载特征、主机特征和同一主机中任务之间的争用的情况下估计任务服务水平协议(如响应时间)。实验表明，机器学习算法可以以可接受的精度预测系统行为，并且它们与精确或近似调度器的组合可以将任务分配给主机，从而在执行任务的收益、服务质量和功耗之间取得平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2011 IEEE/ACM 12th International Conference on Grid Computing

自引率

0.00%

发文量

期刊最新文献

Replicated Grid Resources HisT/PLIER: A Two-Fold Provenance Approach for Grid-Enabled Scientific Workflows Using WS-VLAM Using the Gfarm File System as a POSIX Compatible Storage Platform for Hadoop MapReduce Applications MARIANE: MApReduce Implementation Adapted for HPC Environments Improved Grid Security Posture through Multi-factor Authentication