{"title":"MapReduce应用的高效调度框架","authors":"Nikos Zacheilas, V. Kalogeraki","doi":"10.1109/ICAC.2015.38","DOIUrl":null,"url":null,"abstract":"Real-time, cost-effective execution of \"Big Data\" applications on MapReduce clusters has been an important goal for many scientists in recent years. The MapReduce paradigm has been widely adopted by major computing companies as a powerful approach for large-scale data analytics. However, running MapReduce workloads in cluster environments has been particularly challenging due to the trade-offs that exist between the need for performance and the corresponding budget cost. Furthermore, the large number of resource configuration parameters exacerbates the problem, as users must manually tune the parameters without knowing their impact on the performance and budget costs. In this paper, we describe our approach to cost-effective scheduling of MapReduce applications. We present an overview of our framework that enables appropriate configuration of parameters to detect cost-efficient resource allocations. Our early experimental results illustrate the working and benefit of our approach.","PeriodicalId":6643,"journal":{"name":"2015 IEEE International Conference on Autonomic Computing","volume":"37 1","pages":"147-148"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"A Framework for Cost-Effective Scheduling of MapReduce Applications\",\"authors\":\"Nikos Zacheilas, V. Kalogeraki\",\"doi\":\"10.1109/ICAC.2015.38\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Real-time, cost-effective execution of \\\"Big Data\\\" applications on MapReduce clusters has been an important goal for many scientists in recent years. The MapReduce paradigm has been widely adopted by major computing companies as a powerful approach for large-scale data analytics. However, running MapReduce workloads in cluster environments has been particularly challenging due to the trade-offs that exist between the need for performance and the corresponding budget cost. Furthermore, the large number of resource configuration parameters exacerbates the problem, as users must manually tune the parameters without knowing their impact on the performance and budget costs. In this paper, we describe our approach to cost-effective scheduling of MapReduce applications. We present an overview of our framework that enables appropriate configuration of parameters to detect cost-efficient resource allocations. Our early experimental results illustrate the working and benefit of our approach.\",\"PeriodicalId\":6643,\"journal\":{\"name\":\"2015 IEEE International Conference on Autonomic Computing\",\"volume\":\"37 1\",\"pages\":\"147-148\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE International Conference on Autonomic Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAC.2015.38\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Autonomic Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAC.2015.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Framework for Cost-Effective Scheduling of MapReduce Applications
Real-time, cost-effective execution of "Big Data" applications on MapReduce clusters has been an important goal for many scientists in recent years. The MapReduce paradigm has been widely adopted by major computing companies as a powerful approach for large-scale data analytics. However, running MapReduce workloads in cluster environments has been particularly challenging due to the trade-offs that exist between the need for performance and the corresponding budget cost. Furthermore, the large number of resource configuration parameters exacerbates the problem, as users must manually tune the parameters without knowing their impact on the performance and budget costs. In this paper, we describe our approach to cost-effective scheduling of MapReduce applications. We present an overview of our framework that enables appropriate configuration of parameters to detect cost-efficient resource allocations. Our early experimental results illustrate the working and benefit of our approach.