{"title":"Reliable Management of Virtualized Resources Using Fault Trees","authors":"A. Butoi, Alexandru-Ioan Stan, G. Silaghi","doi":"10.1109/SYNASC.2014.49","DOIUrl":null,"url":null,"abstract":"The new trends in distributed computing has changed the way we do computing when talking about cloud infrastructures or high-performance computing. Resource virtualization technologies enabled elasticity of resource provisioning and management through easy replication of virtual nodes or virtual machine migration. In order to provide high availability and reliability in such distributed environments where resources are managed and served in form of virtual machines, specific load balancing and fault strategies are needed. Based on fault tree analysis concepts, we propose a distributed and autonomous approach to manage faults using fault agents able to asses and predict for each virtualized node, its state of fault or future fault. Accordingly, each node can take a decision about accepting future jobs, delegate jobs to own replicated instances or start a live migration process as a second strategy for assuring availability and continuity of the service.","PeriodicalId":150575,"journal":{"name":"2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SYNASC.2014.49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
The new trends in distributed computing has changed the way we do computing when talking about cloud infrastructures or high-performance computing. Resource virtualization technologies enabled elasticity of resource provisioning and management through easy replication of virtual nodes or virtual machine migration. In order to provide high availability and reliability in such distributed environments where resources are managed and served in form of virtual machines, specific load balancing and fault strategies are needed. Based on fault tree analysis concepts, we propose a distributed and autonomous approach to manage faults using fault agents able to asses and predict for each virtualized node, its state of fault or future fault. Accordingly, each node can take a decision about accepting future jobs, delegate jobs to own replicated instances or start a live migration process as a second strategy for assuring availability and continuity of the service.