{"title":"Exploring the power of resource allocation for Spark executor","authors":"Huihong He, Yan Li, Yanfei Lv, Yong Wang","doi":"10.1109/ICSESS.2016.7883042","DOIUrl":null,"url":null,"abstract":"Nowadays Spark has been widely adopted as a sharp blade in solving big data problems by pipelining tasks of jobs on each node of cluster. In order to improve cluster resource utilization, lots of Spark performance-tuning advices have been proposed both by Spark and researchers. However, we notice that most of these advices focus tuning configuration items in isolation without considering job characteristics. In this paper, we try to explore the impact of executor quota allocation for Spark job in consideration of job stages and size of input. Dozens of carefully designed experiments reveal that execution time among job stages varies in probability as executor quota changes and thus the job execution time varies. We believe this conclusion helps to shed light on allocating executor resource quota regarding to job characteristics.","PeriodicalId":175933,"journal":{"name":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","volume":"264 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSESS.2016.7883042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Nowadays Spark has been widely adopted as a sharp blade in solving big data problems by pipelining tasks of jobs on each node of cluster. In order to improve cluster resource utilization, lots of Spark performance-tuning advices have been proposed both by Spark and researchers. However, we notice that most of these advices focus tuning configuration items in isolation without considering job characteristics. In this paper, we try to explore the impact of executor quota allocation for Spark job in consideration of job stages and size of input. Dozens of carefully designed experiments reveal that execution time among job stages varies in probability as executor quota changes and thus the job execution time varies. We believe this conclusion helps to shed light on allocating executor resource quota regarding to job characteristics.