Reinforcement Learning based Scheduling for Spark Jobs in Cloud Environment

Vishnu Prasad Verma, Nenavath Srinivas Naik, Santosh Kumar
{"title":"Reinforcement Learning based Scheduling for Spark Jobs in Cloud Environment","authors":"Vishnu Prasad Verma, Nenavath Srinivas Naik, Santosh Kumar","doi":"10.1109/UPCON56432.2022.9986440","DOIUrl":null,"url":null,"abstract":"Recently, big data computing paradigm has been gaining proliferation due to wide applications for processing enormous volumes of data to produce meaningful information. The big data computing frameworks perform data processing in cloud computing or physical on-premises. Cloud service providers provide flexible, affordable, and reliable resources that are easier to manage than on-premise physical data centers. So many organization are now moving their big data computing framework over to the cloud computing environment. However, due to several limitations, including the need to reduce costs for using virtual machines, optimize system performance by lowering the Average job completion time, and adhere to service level agreements for the jobs, scheduling Spark jobs efficiently in a cloud environment is a challenging problem. Numerous heuristic-based solutions are available in the literature; however, they do not work well in heterogeneous cloud environments where many constraints are present while scheduling the jobs. So, in this paper, we have optimized the use of computing resources in a cloud environment by analyzing spark job scheduling based on reinforcement learning algorithms. The case study's proposed analysis demonstrates how a reinforcement learning algorithm enables an agent to learn the inherent properties of the computing environment for job scheduling.","PeriodicalId":185782,"journal":{"name":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","volume":"149 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 9th Uttar Pradesh Section International Conference on Electrical, Electronics and Computer Engineering (UPCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/UPCON56432.2022.9986440","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recently, big data computing paradigm has been gaining proliferation due to wide applications for processing enormous volumes of data to produce meaningful information. The big data computing frameworks perform data processing in cloud computing or physical on-premises. Cloud service providers provide flexible, affordable, and reliable resources that are easier to manage than on-premise physical data centers. So many organization are now moving their big data computing framework over to the cloud computing environment. However, due to several limitations, including the need to reduce costs for using virtual machines, optimize system performance by lowering the Average job completion time, and adhere to service level agreements for the jobs, scheduling Spark jobs efficiently in a cloud environment is a challenging problem. Numerous heuristic-based solutions are available in the literature; however, they do not work well in heterogeneous cloud environments where many constraints are present while scheduling the jobs. So, in this paper, we have optimized the use of computing resources in a cloud environment by analyzing spark job scheduling based on reinforcement learning algorithms. The case study's proposed analysis demonstrates how a reinforcement learning algorithm enables an agent to learn the inherent properties of the computing environment for job scheduling.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
云环境下基于强化学习的Spark作业调度
近年来,由于处理大量数据以产生有意义的信息的广泛应用,大数据计算范式得到了蓬勃发展。大数据计算框架在云计算或物理本地进行数据处理。云服务提供商提供灵活、经济、可靠的资源,比内部部署的物理数据中心更容易管理。因此,许多组织现在正在将他们的大数据计算框架转移到云计算环境。然而,由于一些限制,包括需要降低使用虚拟机的成本,通过降低平均作业完成时间来优化系统性能,以及遵守作业的服务水平协议,因此在云环境中有效地调度Spark作业是一个具有挑战性的问题。文献中有许多基于启发式的解决方案;但是,它们在异构云环境中不能很好地工作,因为在调度作业时存在许多约束。因此,在本文中,我们通过分析基于强化学习算法的spark作业调度,优化了云环境下计算资源的使用。案例研究提出的分析演示了强化学习算法如何使代理能够学习作业调度计算环境的固有属性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Mains Interface Circuit Design for Traveling Wave Tube Amplifier A Passive Technique for Detecting Islanding Using Voltage Sequence Component A Unified Framework for Covariance Adaptation with Multiple Source Domains Advance Sensor for Monitoring Electrolyte Leakage in Lithium-ion Batteries for Electric Vehicles A comparative study of survey papers based on energy efficient, coverage-aware, and fault tolerant in static sink node of WSN
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1