{"title":"Learning to self-recover","authors":"Thomas Reidemeister, Miao Jiang, Paul A. S. Ward","doi":"10.1109/INM.2011.5990506","DOIUrl":null,"url":null,"abstract":"Business success is contingent on dependable, yet affordable, software systems; this implies a need for self-recovering cloud-based component software systems. In prior work we demonstrated a discrete controller that allows scheduling of recovery actions based on uncertain fault knowledge. That approach required detailed analysis of historic failure data. In this paper we examine adaptive learning through active exploration and demonstrate the impact of drifting or invalid knowledge about recovery actions.","PeriodicalId":433520,"journal":{"name":"12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INM.2011.5990506","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Business success is contingent on dependable, yet affordable, software systems; this implies a need for self-recovering cloud-based component software systems. In prior work we demonstrated a discrete controller that allows scheduling of recovery actions based on uncertain fault knowledge. That approach required detailed analysis of historic failure data. In this paper we examine adaptive learning through active exploration and demonstrate the impact of drifting or invalid knowledge about recovery actions.