{"title":"Failure-Aware Virtual Machine Configuration for Cloud Computing","authors":"Yaqin Luo, Li Qi","doi":"10.1109/APSCC.2012.46","DOIUrl":null,"url":null,"abstract":"Failure occurrence and its impact on system performance have become an increasingly important concern in cloud computing. Most techniques in today's system are reactive schemes to recover after failure which could lead to major cost and significantly affect system performance. Instead, we propose a proactive failure aware virtual machine infrastructure for cloud computing. Our approach takes both the performance and reliability status of a node into account to forecast failure in a given time window. We leverage failure prediction techniques to mitigate the potential failure impact on system reliability and productivity. The mechanism can also reschedule the running job in case the failures occurred during execution. The experiment results show the enhancement of system productivity and reliability significantly by using the proposed strategy.","PeriodicalId":256842,"journal":{"name":"2012 IEEE Asia-Pacific Services Computing Conference","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Asia-Pacific Services Computing Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSCC.2012.46","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Failure occurrence and its impact on system performance have become an increasingly important concern in cloud computing. Most techniques in today's system are reactive schemes to recover after failure which could lead to major cost and significantly affect system performance. Instead, we propose a proactive failure aware virtual machine infrastructure for cloud computing. Our approach takes both the performance and reliability status of a node into account to forecast failure in a given time window. We leverage failure prediction techniques to mitigate the potential failure impact on system reliability and productivity. The mechanism can also reschedule the running job in case the failures occurred during execution. The experiment results show the enhancement of system productivity and reliability significantly by using the proposed strategy.