Recoverable distributed shared virtual memory: memory coherence and storage structures

[1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers Pub Date : 1989-06-21 DOI:10.1109/FTCS.1989.105629

Kun-Lung Wu, W. Fuchs

引用次数: 13

Abstract

An examination is made of the problem of implementing rollback recovery in multicomputer distributed shared virtual memory environments, in which the shared memory is implemented in software and exists only virtually. A user-transparent checkpointing recovery scheme and a twin-page disk storage management are presented to implement a recoverable distributed shared virtual memory. The checkpointing scheme is integrated with the shared virtual memory management. The twin-page disk approach allows incremental checkpointing without an explicit 'undo' at the time of recovery. A single consistent checkpoint state is maintained on stable disk storage. The recoverable distributed shared virtual memory allows the system to restart computation from a previous checkpoint after a processor failure without a global restart.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可恢复分布式共享虚拟内存:内存一致性和存储结构

研究了在多计算机分布式共享虚拟内存环境中实现回滚恢复的问题，在这种环境中，共享内存是通过软件实现的，只是虚拟存在的。为了实现可恢复的分布式共享虚拟内存，提出了用户透明的检查点恢复方案和双页磁盘存储管理。检查点方案与共享虚拟内存管理相结合。双页磁盘方法允许增量检查点，而不需要在恢复时显式地“撤消”。在稳定的磁盘存储上保持单一一致的检查点状态。可恢复的分布式共享虚拟内存允许系统在处理器故障后从先前的检查点重新开始计算，而无需全局重新启动

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

[1989] The Nineteenth International Symposium on Fault-Tolerant Computing. Digest of Papers

自引率

0.00%

发文量

期刊最新文献

Replication within atomic actions and conversations: a case study in fault-tolerance duality Byte unidirectional error correcting codes F-T in telecommunications networks: state, perspectives, trends Evaluation of fault-tolerant systems with nonhomogeneous workloads Control-flow checking using watchdog assists and extended-precision checksums