Tzu-Chi Huang, Kuo-Chih Chu, Jiahuei Lin, Guo-Hao Huang, C. Shieh
{"title":"Workload Alleviation Scheduling Framework to Alleviate Negative Performance Impact of Intermediate Data Skew in Small-Scale MapReduce Cloud","authors":"Tzu-Chi Huang, Kuo-Chih Chu, Jiahuei Lin, Guo-Hao Huang, C. Shieh","doi":"10.1109/ICSSE.2018.8520003","DOIUrl":null,"url":null,"abstract":"A MapReduce cloud becomes the essential platform in the cloud computing infrastructure today. Because applications may process input data with different algorithms and logics to produce intermediate data, a MapReduce cloud may suffer intermediate data skew by unevenly distributing intermediate data among nodes at run time. When intermediate data skew happens, a MapReduce cloud not only idles nodes to waste computation resources but also prolongs the application execution progress to hurt user experiences in cloud computing. Instead of the existing solutions that assume many available idle nodes and use computation resources in a loose way, a MapReduce cloud can use the Workload Alleviation Scheduling Framework (W ASF) proposed in this paper to alleviate the negative performance impact of intermediate data skew in a small-scale MapReduce cloud by smartly utilizing computation resources. Besides, a MapReduce cloud is verified with popular applications in experiments to have the outstanding performance improvement with W ASF when intermediate data skew happens.","PeriodicalId":431387,"journal":{"name":"2018 International Conference on System Science and Engineering (ICSSE)","volume":"2014 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on System Science and Engineering (ICSSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSE.2018.8520003","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A MapReduce cloud becomes the essential platform in the cloud computing infrastructure today. Because applications may process input data with different algorithms and logics to produce intermediate data, a MapReduce cloud may suffer intermediate data skew by unevenly distributing intermediate data among nodes at run time. When intermediate data skew happens, a MapReduce cloud not only idles nodes to waste computation resources but also prolongs the application execution progress to hurt user experiences in cloud computing. Instead of the existing solutions that assume many available idle nodes and use computation resources in a loose way, a MapReduce cloud can use the Workload Alleviation Scheduling Framework (W ASF) proposed in this paper to alleviate the negative performance impact of intermediate data skew in a small-scale MapReduce cloud by smartly utilizing computation resources. Besides, a MapReduce cloud is verified with popular applications in experiments to have the outstanding performance improvement with W ASF when intermediate data skew happens.