{"title":"ResilientVM: high performance virtual machine recovery in the cloud","authors":"V. Salapura, R. Harper","doi":"10.1145/2747470.2747472","DOIUrl":null,"url":null,"abstract":"In this paper, we present a scalable parallel virtual machine planning and failover method for high availability at a virtual machine (VM) level in a data center. The solution is implemented in IBM's Cloud Managed Services (CMS) enterprise cloud offering for rapid failover in large data centers with a large number of servers, VMs, and disks. The failover system enables failover-time planning and execution and keeps the recovery time within limits of service level agreement (SLA) allowed time budget. The initial serial failover time is reduced by a factor of up to 11 for parallel implementation, and by a factor of up to 44 for parallel failover - parallel storage mapping implementation.","PeriodicalId":328734,"journal":{"name":"AIMC '15","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AIMC '15","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2747470.2747472","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we present a scalable parallel virtual machine planning and failover method for high availability at a virtual machine (VM) level in a data center. The solution is implemented in IBM's Cloud Managed Services (CMS) enterprise cloud offering for rapid failover in large data centers with a large number of servers, VMs, and disks. The failover system enables failover-time planning and execution and keeps the recovery time within limits of service level agreement (SLA) allowed time budget. The initial serial failover time is reduced by a factor of up to 11 for parallel implementation, and by a factor of up to 44 for parallel failover - parallel storage mapping implementation.