{"title":"A high-level process checkpointing and migration scheme for heterogeneous distributed systems","authors":"T. Redhead","doi":"10.1109/ICDP.1996.864208","DOIUrl":null,"url":null,"abstract":"Reliability is a key concern of designers of distributed computing systems. Checkpointing can be used as a basis for designing resilient processes and process-migration schemes, but very few systems which implement process-checkpointing are heterogeneous. 'High-level' process checkpointing schemes capture process-state at a higher level of abstraction than do low-level schemes. The resulting state does not depend on low-level or platform-specific structures, and so is meaningful at any site in a heterogeneous distributed computing network. This paper presents a high-level approach to process checkpointing which is transparent to the programmer, which operates at a fine level of granularity, and which can deal with dynamically allocated memory and multithreaded processes.","PeriodicalId":127207,"journal":{"name":"Proceedings of IFIP/IEEE International Conference on Distributed Platforms","volume":"62 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of IFIP/IEEE International Conference on Distributed Platforms","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDP.1996.864208","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Reliability is a key concern of designers of distributed computing systems. Checkpointing can be used as a basis for designing resilient processes and process-migration schemes, but very few systems which implement process-checkpointing are heterogeneous. 'High-level' process checkpointing schemes capture process-state at a higher level of abstraction than do low-level schemes. The resulting state does not depend on low-level or platform-specific structures, and so is meaningful at any site in a heterogeneous distributed computing network. This paper presents a high-level approach to process checkpointing which is transparent to the programmer, which operates at a fine level of granularity, and which can deal with dynamically allocated memory and multithreaded processes.