Evaluation of Energy Consumption of Replicated Tasks in a Volunteer Computing Environment

Companion of the 2018 ACM/SPEC International Conference on Performance Engineering Pub Date : 2018-04-02 DOI:10.1145/3185768.3186313

A. McGough, M. Forshaw

{"title":"Evaluation of Energy Consumption of Replicated Tasks in a Volunteer Computing Environment","authors":"A. McGough, M. Forshaw","doi":"10.1145/3185768.3186313","DOIUrl":null,"url":null,"abstract":"High Throughput Computing allows workloads of many thousands of tasks to be performed efficiently over many distributed resources and frees the user from the laborious process of managing task deployment, execution and result collection. However, in many cases the High Throughput Computing system is comprised from volunteer computational resources where tasks may be evicted by the owner of the resource. This has two main disadvantages. First, tasks may take longer to run as they may require multiple deployments before finally obtaining enough time on a resource to complete. Second, the wasted computation time will lead to wasted energy. We may be able to reduce the effect of the first disadvantage here by submitting multiple replicas of the task and take the results from the first one to complete. This, though, could lead to a significant increase in energy consumption. Thus we desire to only ever submit the minimum number of replicas required to run the task in the allocated time whilst simultaneously minimising energy. In this work we evaluate the use of fixed replica counts and Reinforcement Learning on the proportion of task which fail to finish in a given time-frame and the energy consumed by the system.","PeriodicalId":10596,"journal":{"name":"Companion of the 2018 ACM/SPEC International Conference on Performance Engineering","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2018-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Companion of the 2018 ACM/SPEC International Conference on Performance Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3185768.3186313","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

High Throughput Computing allows workloads of many thousands of tasks to be performed efficiently over many distributed resources and frees the user from the laborious process of managing task deployment, execution and result collection. However, in many cases the High Throughput Computing system is comprised from volunteer computational resources where tasks may be evicted by the owner of the resource. This has two main disadvantages. First, tasks may take longer to run as they may require multiple deployments before finally obtaining enough time on a resource to complete. Second, the wasted computation time will lead to wasted energy. We may be able to reduce the effect of the first disadvantage here by submitting multiple replicas of the task and take the results from the first one to complete. This, though, could lead to a significant increase in energy consumption. Thus we desire to only ever submit the minimum number of replicas required to run the task in the allocated time whilst simultaneously minimising energy. In this work we evaluate the use of fixed replica counts and Reinforcement Learning on the proportion of task which fail to finish in a given time-frame and the energy consumed by the system.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

志愿计算环境下重复任务的能量消耗评价

高吞吐量计算允许在许多分布式资源上高效地执行数千个任务的工作负载，并将用户从管理任务部署、执行和结果收集的繁重过程中解放出来。然而，在许多情况下，高吞吐量计算系统由志愿计算资源组成，其中任务可能被资源所有者驱逐。这有两个主要缺点。首先，任务可能需要更长的时间来运行，因为它们可能需要多次部署才能最终在资源上获得足够的时间来完成。其次，浪费的计算时间会导致能源的浪费。我们可以通过提交任务的多个副本并从第一个副本获取结果来减少第一个缺点的影响。然而，这可能会导致能源消耗的显著增加。因此，我们希望只提交在分配的时间内运行任务所需的最小数量的副本，同时最大限度地减少能量。在这项工作中，我们评估了固定副本计数和强化学习在给定时间框架内无法完成的任务比例和系统消耗的能量的使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Companion of the 2018 ACM/SPEC International Conference on Performance Engineering

自引率

0.00%

发文量