恢复机制对保存损坏状态的可能性的影响

13th International Symposium on Software Reliability Engineering, 2002. Proceedings. Pub Date : 2002-11-12 DOI:10.1109/ISSRE.2002.1173219

Subhachandra Chandra, Peter M. Chen

{"title":"恢复机制对保存损坏状态的可能性的影响","authors":"Subhachandra Chandra, Peter M. Chen","doi":"10.1109/ISSRE.2002.1173219","DOIUrl":null,"url":null,"abstract":"Recovery systems must save state before a failure occurs to enable the system to recover from the failure. However, recovery will fail if the recovery system saves any state corrupted by the fault. The frequency and comprehensiveness of how a recovery system saves state has a major effect on how often the recovery system inadvertently saves corrupted state. This paper explores and measures that effect. We measure how often software faults in the application and operating system cause real applications to save corrupted state when using different types of recovery systems. We find that generic recovery techniques, such as checkpointing and logging, work well for faults in the operating system. However, we find that they do not work well for faults in the application because the very actions taken to enable recovery often corrupt the state upon which successful recovery depends.","PeriodicalId":159160,"journal":{"name":"13th International Symposium on Software Reliability Engineering, 2002. Proceedings.","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"25","resultStr":"{\"title\":\"The impact of recovery mechanisms on the likelihood of saving corrupted state\",\"authors\":\"Subhachandra Chandra, Peter M. Chen\",\"doi\":\"10.1109/ISSRE.2002.1173219\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recovery systems must save state before a failure occurs to enable the system to recover from the failure. However, recovery will fail if the recovery system saves any state corrupted by the fault. The frequency and comprehensiveness of how a recovery system saves state has a major effect on how often the recovery system inadvertently saves corrupted state. This paper explores and measures that effect. We measure how often software faults in the application and operating system cause real applications to save corrupted state when using different types of recovery systems. We find that generic recovery techniques, such as checkpointing and logging, work well for faults in the operating system. However, we find that they do not work well for faults in the application because the very actions taken to enable recovery often corrupt the state upon which successful recovery depends.\",\"PeriodicalId\":159160,\"journal\":{\"name\":\"13th International Symposium on Software Reliability Engineering, 2002. Proceedings.\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"25\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"13th International Symposium on Software Reliability Engineering, 2002. Proceedings.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSRE.2002.1173219\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"13th International Symposium on Software Reliability Engineering, 2002. Proceedings.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSRE.2002.1173219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 25

摘要

恢复系统必须在故障发生之前保存状态，以使系统能够从故障中恢复。但是，如果恢复系统保存任何被故障损坏的状态，则恢复将失败。恢复系统保存状态的频率和全面性对恢复系统无意中保存损坏状态的频率有很大影响。本文对这种影响进行了探讨和测量。当使用不同类型的恢复系统时，我们测量应用程序和操作系统中的软件故障导致实际应用程序保存损坏状态的频率。我们发现，一般的恢复技术，如检查点和日志记录，对操作系统中的故障很有效。然而，我们发现它们不能很好地处理应用程序中的错误，因为为启用恢复而采取的操作通常会破坏成功恢复所依赖的状态。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

The impact of recovery mechanisms on the likelihood of saving corrupted state

Recovery systems must save state before a failure occurs to enable the system to recover from the failure. However, recovery will fail if the recovery system saves any state corrupted by the fault. The frequency and comprehensiveness of how a recovery system saves state has a major effect on how often the recovery system inadvertently saves corrupted state. This paper explores and measures that effect. We measure how often software faults in the application and operating system cause real applications to save corrupted state when using different types of recovery systems. We find that generic recovery techniques, such as checkpointing and logging, work well for faults in the operating system. However, we find that they do not work well for faults in the application because the very actions taken to enable recovery often corrupt the state upon which successful recovery depends.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

13th International Symposium on Software Reliability Engineering, 2002. Proceedings.

自引率

0.00%

发文量

期刊最新文献

Test reuse in the spreadsheet paradigm Heterogeneous software reliability modeling Toward a quantifiable definition of software faults Emulation of software faults by educated mutations at machine-code level Metrics for measuring the effectiveness of software-testing tools