Reinforcement Learning based Scheduling for Spark Jobs in Cloud Environment

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON) Pub Date : 2022-12-02 DOI:10.1109/UPCON56432.2022.9986440

Vishnu Prasad Verma, Nenavath Srinivas Naik, Santosh Kumar

{"title":"Reinforcement Learning based Scheduling for Spark Jobs in Cloud Environment","authors":"Vishnu Prasad Verma, Nenavath Srinivas Naik, Santosh Kumar","doi":"10.1109/UPCON56432.2022.9986440","DOIUrl":null,"url":null,"abstract":"Recently, big data computing paradigm has been gaining proliferation due to wide applications for processing enormous volumes of data to produce meaningful information. The big data computing frameworks perform data processing in cloud computing or physical on-premises. Cloud service providers provide flexible, affordable, and reliable resources that are easier to manage than on-premise physical data centers. So many organization are now moving their big data computing framework over to the cloud computing environment. However, due to several limitations, including the need to reduce costs for using virtual machines, optimize system performance by lowering the Average job completion time, and adhere to service level agreements for the jobs, scheduling Spark jobs efficiently in a cloud environment is a challenging problem. Numerous heuristic-based solutions are available in the literature; however, they do not work well in heterogeneous cloud environments where many constraints are present while scheduling the jobs. So, in this paper, we have optimized the use of computing resources in a cloud environment by analyzing spark job scheduling based on reinforcement learning algorithms. The case study's proposed analysis demonstrates how a reinforcement learning algorithm enables an agent to learn the inherent properties of the computing environment for job scheduling.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recently, big data computing paradigm has been gaining proliferation due to wide applications for processing enormous volumes of data to produce meaningful information. The big data computing frameworks perform data processing in cloud computing or physical on-premises. Cloud service providers provide flexible, affordable, and reliable resources that are easier to manage than on-premise physical data centers. So many organization are now moving their big data computing framework over to the cloud computing environment. However, due to several limitations, including the need to reduce costs for using virtual machines, optimize system performance by lowering the Average job completion time, and adhere to service level agreements for the jobs, scheduling Spark jobs efficiently in a cloud environment is a challenging problem. Numerous heuristic-based solutions are available in the literature; however, they do not work well in heterogeneous cloud environments where many constraints are present while scheduling the jobs. So, in this paper, we have optimized the use of computing resources in a cloud environment by analyzing spark job scheduling based on reinforcement learning algorithms. The case study's proposed analysis demonstrates how a reinforcement learning algorithm enables an agent to learn the inherent properties of the computing environment for job scheduling.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

云环境下基于强化学习的Spark作业调度

近年来，由于处理大量数据以产生有意义的信息的广泛应用，大数据计算范式得到了蓬勃发展。大数据计算框架在云计算或物理本地进行数据处理。云服务提供商提供灵活、经济、可靠的资源，比内部部署的物理数据中心更容易管理。因此，许多组织现在正在将他们的大数据计算框架转移到云计算环境。然而，由于一些限制，包括需要降低使用虚拟机的成本，通过降低平均作业完成时间来优化系统性能，以及遵守作业的服务水平协议，因此在云环境中有效地调度Spark作业是一个具有挑战性的问题。文献中有许多基于启发式的解决方案;但是，它们在异构云环境中不能很好地工作，因为在调度作业时存在许多约束。因此，在本文中，我们通过分析基于强化学习算法的spark作业调度，优化了云环境下计算资源的使用。案例研究提出的分析演示了强化学习算法如何使代理能够学习作业调度计算环境的固有属性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)

自引率

0.00%

发文量