Node covering, error correcting codes and multiprocessors with very high average fault tolerance

Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers Pub Date : 1995-06-27 DOI:10.1109/FTCS.1995.466967

S. Dutt, N. Mahapatra

引用次数: 27

Abstract

Most previous work on fault-tolerant (FT) multiprocessor design has concentrated on deterministic k-fault-tolerant (k-FT) designs in which exactly k spare processors and some spare switches and links are added to construct multiprocessors that can tolerate any k processor faults. However, after k faults are reconfigured around, much of the extra links and switches can remain unutilized. We show how to use the node-covering principle of Dutt and Hayes (1992) and error correcting codes in order to construct probabilistic designs with very high average fault tolerance but low wiring and switch overhead. This design methodology is applicable to any multiprocessor interconnection topology. We also obtain the deterministic fault tolerance for these designs and develop efficient layout strategies for them.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

节点覆盖，纠错码和多处理器具有非常高的平均容错性

先前关于容错多处理器设计的大部分工作都集中在确定性k容错设计(k-FT)上，在这种设计中，精确地添加k个备用处理器和一些备用开关和链路来构建能够容忍任意k个处理器故障的多处理器。然而，在k个故障被重新配置之后，许多额外的链路和交换机可能仍然没有被利用。我们展示了如何使用Dutt和Hayes(1992)的节点覆盖原理和纠错码来构建具有非常高的平均容错性但低布线和开关开销的概率设计。这种设计方法适用于任何多处理器互连拓扑结构。我们还获得了这些设计的确定性容错性，并为它们制定了有效的布局策略

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers

自引率

0.00%

发文量

期刊最新文献

Design verification of a super-scalar RISC processor ARMOR: analyzer for reducing module operational risk Evaluation of software dependability based on stability test data Modeling and testing a critical fault-tolerant multi-process system Measuring robustness of a fault tolerant aerospace system