High-coverage fault tolerance in real-time systems based on point-to-point communication

K. Kim, C. Subbaraman, E. Shokri
{"title":"High-coverage fault tolerance in real-time systems based on point-to-point communication","authors":"K. Kim, C. Subbaraman, E. Shokri","doi":"10.1109/HASE.1997.648053","DOIUrl":null,"url":null,"abstract":"The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which has been outlined in recent years (but not developed fully) is the integration of the DRB scheme and a network surveillance (NS) scheme. We have developed an NS scheme that is effective in a variety of point-to-point networks, called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in those systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.","PeriodicalId":319609,"journal":{"name":"Proceedings 1997 High-Assurance Engineering Workshop","volume":"197 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 1997 High-Assurance Engineering Workshop","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HASE.1997.648053","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The distributed recovery block (DRB) scheme is a widely applicable approach for realizing both hardware and software fault tolerance in real-time distributed and parallel computer systems. One of the most important extensions of the DRB scheme which has been outlined in recent years (but not developed fully) is the integration of the DRB scheme and a network surveillance (NS) scheme. We have developed an NS scheme that is effective in a variety of point-to-point networks, called the supervisor-based NS (SNS) scheme. In this paper, we present an integration of the DRB scheme with the SNS scheme, called the DRB/SNS scheme. This scheme is a significant improvement over the previous versions of the DRB scheme with respect to the fault coverage and recovery time bound achieved in those systems that are based on point-to-point networks. The execution support for the integrated scheme has been implemented as a part of the DREAM kernel prototype, a timeliness-guaranteed operating system kernel developed at the University of California, Irvine. The recovery time bound of the DRB/SNS scheme is analyzed on the basis of the prototype implementation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于点对点通信的实时系统的高覆盖容错
分布式恢复块(DRB)方案是在实时分布式并行计算机系统中实现硬件和软件容错的一种广泛适用的方法。近年来已概述(但尚未充分发展)的DRB方案最重要的扩展之一是DRB方案和网络监测(NS)方案的集成。我们已经开发了一种在各种点对点网络中有效的NS方案,称为基于监督者的NS (SNS)方案。在本文中,我们提出了一个将DRB方案与SNS方案相结合的方案,称为DRB/SNS方案。在基于点对点网络的系统中,该方案在故障覆盖和恢复时间范围方面比以前版本的DRB方案有了重大改进。对集成方案的执行支持已经作为DREAM内核原型的一部分实现,DREAM内核原型是由加州大学欧文分校开发的一个时效性保证的操作系统内核。在原型实现的基础上,分析了DRB/SNS方案的恢复时限。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Scalable and reliable synchronous collaboration environment on CORBA using WWW Modeling applications for adaptive QoS-based resource management Experience in capturing requirements for safety-critical medical devices in an industrial environment Software complexity analysis on department of defense real-time systems A mechanism for communicating in dynamically reconfigurable embedded systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1