[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium最新文献

英文中文

The minimal test set for sorting networks and the use of sorting networks in self-testing checkers for unordered codes 排序网络的最小测试集和排序网络在无序代码自检检查器中的使用

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89382

S. Piestrak

It is shown that an n-input sorting network (SN) can be used to implement all n-variable symmetric threshold functions, using the least amount of hardware. A procedure of generating the minimal test set for K.E. Batcher's SNs is presented. An upper bound is determined for the number of tests required to detect all stuck-at faults in an n-input SN; it is fewer than in similar designs used to date. Finally, it is shown that the SNs can be used to realize easily testable self-testing checkers (STCs) for m-out-of-2m codes and all J.M. Berger codes. The new STCs for m/2m codes (m>3) have the lowest gate count and require the fewest number of tests. Upper bounds are also found for the number of tests required by the new STCs for Berger codes with I information bits. For I>or=14 they require fewer gates than similar designs known to date.<>

结果表明，n输入排序网络(SN)可以用最少的硬件实现所有n变量对称阈值函数。提出了一种生成K.E. Batcher的最小测试集的方法。确定了检测n输入SN中所有卡在故障所需的测试次数的上限;它比迄今使用的类似设计要少。最后，证明了SNs可以用于实现m- of-2m码和所有J.M. Berger码的易于测试的自检检查器(STCs)。m/2m码(m>3)的新STCs具有最低的门数，并且需要最少的测试次数。对于具有1个信息位的伯杰码，还发现了新的STCs所需的测试次数的上限。对于I>或=14，它们需要比迄今为止已知的类似设计更少的门。

引用次数: 30

A software based approach to achieving optimal performance for signature control flow checking 一种基于软件实现签名控制流检查的最佳性能的方法

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89399

N. Warter, Wen-mei W. Hwu

The authors present a software-based approach that uses run-time program behavior to minimize the performance overhead in signature control flow checking. In general, for both RISC (reduced-instruction-set-computer) and CISC (complex-instruction-set-computer) architectures, it is found that using run-time information can reduce the performance overhead by 50%. For the MC68000, the performance overhead for adding justifying and reference signatures to the program code is approximately 2.8%. In addition to optimizing the performance, the authors' approach does not increase the hardware complexity of the monitor. Furthermore, an O(N/sup 2/) algorithm which inserts justifying signatures on the arcs of the program control flow graph with N nodes is presented. It is shown that the algorithm complexity of previous schemes which insert justifying signatures in the program nodes is exponential.<>

作者提出了一种基于软件的方法，该方法使用运行时程序行为来最小化签名控制流检查中的性能开销。一般来说，对于精简指令集计算机(RISC)和复杂指令集计算机(CISC)体系结构，研究发现使用运行时信息可以将性能开销降低50%。对于MC68000，向程序代码中添加校验和引用签名的性能开销约为2.8%。除了优化性能之外，作者的方法不会增加监视器的硬件复杂性。在此基础上，提出了一种O(N/sup 2/)算法，在N节点程序控制流图的圆弧上插入证明签名。结果表明，以往在程序节点中插入证明签名方案的算法复杂度呈指数级增长。

引用次数: 40

A dependence graph-based approach to the design of algorithm-based fault tolerant systems 基于依赖图的算法容错系统设计方法

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89347

B. Vinnakota, N. Jha

A two-stage approach to the design of algorithm-based fault-tolerant (ABFT) systems is proposed. In the first stage a code is chosen to encode the data used in the algorithm. In the second stage the optimal architecture for implementing the scheme is chosen through the use of dependence graphs. Dependence graphs are a graph-theoretic form of algorithm representation. It is demonstrated that not all architectures are ideal for the implementation of a particular ABFT scheme. The authors propose new measures for characterizing the fault-tolerance capability of a system in order to better exploit the proposed design method. Dependence graphs can also be used for the synthesis of ABFT schemes for nonlinear problems. An example of a fault-tolerant median filter is provided to illustrate the usefulness of the dependence graph as a design tool for nonlinear system synthesis.<>

提出了一种基于算法的容错系统的两阶段设计方法。在第一阶段，选择代码对算法中使用的数据进行编码。在第二阶段，通过使用依赖图选择实现方案的最优体系结构。依赖图是算法表示的一种图论形式。结果表明，并不是所有的体系结构都适合实现特定的ABFT方案。为了更好地利用所提出的设计方法，作者提出了表征系统容错能力的新方法。依赖图也可以用于非线性问题的ABFT格式的综合。给出了一个容错中值滤波器的例子来说明依赖图作为非线性系统综合设计工具的实用性。

引用次数: 30

Practical application and implementation of distributed system-level diagnosis theory 分布式系统级诊断理论的实际应用与实现

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89380

R. Bianchini, K. Goodwin, D. S. Nydick

A DSD (distributed self-diagnosing) project that consists of the implementation of a distributed self-diagnosis algorithm and its application to distributed computer networks is presented. The EVENT-SELF algorithm presented combines the rigor associated with theoretical results with the resource limitations associated with actual systems. Resource limitations identified in real systems include available message capacity for the communication network and limited processor execution speed. The EVENT-SELF algorithm differs from previously published algorithms by adopting an event-driven approach to self-diagnosability. Algorithm messages are reduced to those messages required to indicate changes in system those messages required to indicate changes in system state. Practical issues regarding the CMU-ECE DSD implementation are considered. These issues include the reconfiguration of the testing subnetwork for environments in which processors can be added and removed. One of the goals of this work is to utilize the developed CMU-ECE DSD system as an experimental test-bed environment for distributed applications.<>

提出了一种分布式自诊断算法的实现及其在分布式计算机网络中的应用。提出的EVENT-SELF算法将理论结果的严谨性与实际系统的资源限制相结合。在实际系统中确定的资源限制包括通信网络的可用消息容量和有限的处理器执行速度。EVENT-SELF算法不同于以前发布的算法，它采用事件驱动的方法来实现自诊断。算法消息被简化为指示系统更改所需的消息和指示系统状态更改所需的消息。考虑了有关CMU-ECE DSD实施的实际问题。这些问题包括为可以添加和删除处理器的环境重新配置测试子网。这项工作的目标之一是利用开发的CMU-ECE DSD系统作为分布式应用程序的实验测试平台环境。

{"title":"Practical application and implementation of distributed system-level diagnosis theory","authors":"R. Bianchini, K. Goodwin, D. S. Nydick","doi":"10.1109/FTCS.1990.89380","DOIUrl":"https://doi.org/10.1109/FTCS.1990.89380","url":null,"abstract":"A DSD (distributed self-diagnosing) project that consists of the implementation of a distributed self-diagnosis algorithm and its application to distributed computer networks is presented. The EVENT-SELF algorithm presented combines the rigor associated with theoretical results with the resource limitations associated with actual systems. Resource limitations identified in real systems include available message capacity for the communication network and limited processor execution speed. The EVENT-SELF algorithm differs from previously published algorithms by adopting an event-driven approach to self-diagnosability. Algorithm messages are reduced to those messages required to indicate changes in system those messages required to indicate changes in system state. Practical issues regarding the CMU-ECE DSD implementation are considered. These issues include the reconfiguration of the testing subnetwork for environments in which processors can be added and removed. One of the goals of this work is to utilize the developed CMU-ECE DSD system as an experimental test-bed environment for distributed applications.<<ETX>>","PeriodicalId":174189,"journal":{"name":"[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128404964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 64

On the modelling and testing of recovery block structures 恢复块体结构的建模与试验研究

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89389

G. Pucci

The authors proposes a reliability model for recovery block structures based on error events that can be observed and distinguished during testing. Strategies are described for the collection of failure histories, which are needed to estimate the model parameters and obtain dependability predictions. Given that the software goes through different testing stages, the model can be employed at different points of the development cycle to assess or forecast the quality of project choices and the resulting product.<>

作者提出了一种基于测试过程中可观察和区分的错误事件的恢复块结构可靠性模型。描述了收集故障历史的策略，这些策略用于估计模型参数并获得可靠性预测。假设软件经过不同的测试阶段，该模型可以在开发周期的不同阶段使用，以评估或预测项目选择和最终产品的质量。

引用次数: 19

A novel concurrent error detection scheme for FFT networks 一种新的FFT网络并发错误检测方案

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89346

D. Tao, C. Hartmann

A novel algorithm-based fault tolerance scheme is proposed for fast Fourier transform (FFT) networks. It is shown that the proposed scheme achieves 100% fault coverage theoretically. An accurate measure of the fault coverage for FFT networks is provided by taking the roundoff error into account. It is shown that the proposed scheme maintains the low hardware overhead and high throughput of J.Y. Jou and J.A. Abraham's scheme and, at the same time, increases the fault coverage significantly.<>

针对快速傅里叶变换(FFT)网络，提出了一种新的基于算法的容错方案。结果表明，该方案在理论上实现了100%的故障覆盖率。通过将舍入误差考虑在内，提供了对FFT网络故障覆盖率的精确度量。结果表明，该方案在保持J.Y. Jou和J.A. Abraham方案的低硬件开销和高吞吐量的同时，显著提高了故障覆盖率。

引用次数: 79

Hierarchical design and analysis of fault-tolerant multiprocessor systems using concurrent error detection 基于并发错误检测的容错多处理机系统分层设计与分析

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1990-06-26 DOI: 10.1109/FTCS.1990.89348

S. Nair, J. Abraham

A composition technique for building large fault-tolerant systems hierarchically using the concept of checks at different levels in the hierarchy is described. A small system of known fault detectability and locatability is replicated several times, and new checks are added at the next higher level. Such checks at different levels can be introduced into most of the existing multiprocessor systems. An analysis technique based on a matrix model is developed. Relationships between the fault detectability and locatability of a basic system are derived, and the corresponding values of the complete system are computed hierarchically. Finally, the techniques are extended to complex systems in which individual processors produce multiple sets of data elements.<>

描述了一种利用层次结构中不同层次的检查概念分层构建大型容错系统的组合技术。一个已知的故障可检测性和可定位性的小系统被复制几次，并在下一个更高的级别添加新的检查。这种不同级别的检查可以引入到大多数现有的多处理器系统中。提出了一种基于矩阵模型的分析方法。推导了基本系统的故障可检测性和可定位性之间的关系，并对整个系统的相应值进行了分层计算。最后，这些技术被扩展到单个处理器产生多组数据元素的复杂系统中。

引用次数: 26

CATCH-compiler-assisted techniques for checkpointing 用于检查点的catch编译器辅助技术

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

Pub Date : 1900-01-01 DOI: 10.1109/FTCS.1990.89337

C. Li, W. Fuchs

A compiler-based approach to generating efficient checkpoints for process recovery is described. The presented approach to checkpointing is programmer, operating system, and hardware transparent. Compile-time information is exploited to maintain the desired checkpoint interval and to reduce the size of checkpoints. Compiler-generated sparse potential checkpoint code is used to maintain the desired checkpoint interval. Adaptive checkpointing has been developed to reduce the size of checkpoints by exploiting potentially large variations in memory usage. A training technique is used in selecting the low-cost, high-coverage potential checkpoints. Since the potential checkpoint selection problem is NP-complete, a heuristic algorithm has been developed to obtain a quick suboptimal solution. These compiler-assisted checkpointing techniques have been implemented in a modified version of the GNU C (GCC) compiler version of 1.34. Experiments utilizing the CATCH GCC compiler on SUN workstations are described.<>

描述了一种基于编译器的方法，用于为进程恢复生成有效的检查点。所提出的检查点方法是程序员、操作系统和硬件透明的。利用编译时信息来维持所需的检查点间隔并减小检查点的大小。编译器生成的稀疏潜在检查点代码用于维护所需的检查点间隔。开发自适应检查点是为了通过利用内存使用的潜在大变化来减小检查点的大小。在选择低成本、高覆盖率的潜在检查点时使用了一种训练技术。由于潜在检查点选择问题是np完全的，因此开发了一种启发式算法来快速获得次优解。这些编译器辅助的检查点技术已经在GNU C (GCC)编译器1.34的修改版本中实现。描述了在SUN工作站上使用CATCH GCC编译器的实验。

引用次数: 120

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

[1990] Digest of Papers. Fault-Tolerant Computing: 20th International Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀