{"title":"Optimal Resource Provisioning Approach based on Cost Modeling for Spark Applications in Public Clouds","authors":"Jianfei Ruan, Q. Zheng, B. Dong","doi":"10.1145/2843966.2843972","DOIUrl":null,"url":null,"abstract":"Efficient resource provisioning is required when running Spark applications in public clouds. However, how to optimize resource provisioning to minimize the time and/or monetary cost for a specific application remains an intractable problem since resource provisioning may differ from application to application and even be affected by the amount of input data. Existing resource settings heavily rely on random selection or previous deployer experience, frequently leading to low-quality resource provisioning. Therefore, there is an urgent need to propose an approach towards optimal resource provisioning for Spark applications in public clouds. This is a PhD proposal, where an approach based on time and monetary cost modeling is presented for cloud resource provisioning optimization under two typical constrained scenarios. The approach systematically drives resource provisioning for a specific Spark application, which may save a significant amount of time and money, compared to randomly selected settings.","PeriodicalId":224203,"journal":{"name":"Proceedings of the Doctoral Symposium of the 16th International Middleware Conference","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Doctoral Symposium of the 16th International Middleware Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2843966.2843972","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Efficient resource provisioning is required when running Spark applications in public clouds. However, how to optimize resource provisioning to minimize the time and/or monetary cost for a specific application remains an intractable problem since resource provisioning may differ from application to application and even be affected by the amount of input data. Existing resource settings heavily rely on random selection or previous deployer experience, frequently leading to low-quality resource provisioning. Therefore, there is an urgent need to propose an approach towards optimal resource provisioning for Spark applications in public clouds. This is a PhD proposal, where an approach based on time and monetary cost modeling is presented for cloud resource provisioning optimization under two typical constrained scenarios. The approach systematically drives resource provisioning for a specific Spark application, which may save a significant amount of time and money, compared to randomly selected settings.