Daniel Rosendo, P. Endo, Guto Leoni Santos, D. Gomes, G. Gonçalves, A. Moreira, J. Kelner, D. Sadok, M. Mahloo
{"title":"Modeling and analyzing power system failures on cloud services","authors":"Daniel Rosendo, P. Endo, Guto Leoni Santos, D. Gomes, G. Gonçalves, A. Moreira, J. Kelner, D. Sadok, M. Mahloo","doi":"10.23919/CNSM.2017.8256034","DOIUrl":null,"url":null,"abstract":"Many enterprises rely on cloud infrastructure to host their critical applications (such as trading, banking transaction, airline reservation system, and credit card authorization). The unavailability of these applications may lead to severe consequences that go beyond the financial losses, reaching the cloud provider reputation too. However, to maintain high availability in a cloud data center is a difficult task due to its complexity. The power subsystem is crucial for the entire operation of the data center because it supplies power for all other subsystems, including IT components and cooling equipment. Some studies have already proposed models to evaluate the availability of the power subsystem, but none of them are based on standard redundancy models. Standards guide cloud providers regarding availability, points of failure, and watts per square foot based on components' redundancy. This paper proposes RBD and Petri Net models based on the TIA-942 standard to estimate the availability of the data center power subsystem and analyze how failures on power subsystem impact the availability of critical applications. These models are important to resource planning and decision making by the cloud providers, because they may identify which components they ought to invest in order to improve the availability level.","PeriodicalId":211611,"journal":{"name":"2017 13th International Conference on Network and Service Management (CNSM)","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th International Conference on Network and Service Management (CNSM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.23919/CNSM.2017.8256034","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Many enterprises rely on cloud infrastructure to host their critical applications (such as trading, banking transaction, airline reservation system, and credit card authorization). The unavailability of these applications may lead to severe consequences that go beyond the financial losses, reaching the cloud provider reputation too. However, to maintain high availability in a cloud data center is a difficult task due to its complexity. The power subsystem is crucial for the entire operation of the data center because it supplies power for all other subsystems, including IT components and cooling equipment. Some studies have already proposed models to evaluate the availability of the power subsystem, but none of them are based on standard redundancy models. Standards guide cloud providers regarding availability, points of failure, and watts per square foot based on components' redundancy. This paper proposes RBD and Petri Net models based on the TIA-942 standard to estimate the availability of the data center power subsystem and analyze how failures on power subsystem impact the availability of critical applications. These models are important to resource planning and decision making by the cloud providers, because they may identify which components they ought to invest in order to improve the availability level.