首页 > 最新文献

Proceedings 20th IEEE Symposium on Reliable Distributed Systems最新文献

英文 中文
Assessing inter-modular error propagation in distributed software 分布式软件中模块间错误传播的评估
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969769
A. Jhumka, M. Hiller, N. Suri
With the functionality of most embedded systems based on software (SW), interactions amongst SW modules arise, resulting in error propagation across them. During SW development, it would be helpful to have a framework that clearly demonstrates the error propagation and containment capabilities of the different SW components. In this paper, we assess the impact of inter-modular error propagation. Adopting a white-box SW approach, we make the following contributions: (a) we study and characterize the error propagation process and derive a set of metrics that quantitatively represents the inter-modular SW interactions, (b) we use a real embedded target system used in an aircraft arrestment system to perform fault-injection experiments to obtain experimental values for the metrics proposed, (c) we show how the set of metrics can be used to obtain the required analytical framework for error propagation analysis. We find that the derived analytical framework establishes a very close correlation between the analytical and experimental values obtained. The intent is to use this framework to be able to systematically develop SW such that inter-modular error propagation is reduced by design.
由于大多数嵌入式系统的功能都基于软件(SW),因此会出现软件模块之间的交互,从而导致错误在它们之间传播。在软件开发过程中,有一个框架可以清楚地展示不同软件组件的错误传播和遏制能力,这将是很有帮助的。在本文中,我们评估了模间误差传播的影响。采用白盒软件方法,我们做出了以下贡献:(a)我们研究并描述了错误传播过程,并推导出一组量化表示模块化间软件相互作用的指标,(b)我们使用飞机拦阻系统中使用的真实嵌入式目标系统进行故障注入实验,以获得所提出指标的实验值,(c)我们展示了如何使用这组指标来获得错误传播分析所需的分析框架。我们发现导出的分析框架在得到的分析值和实验值之间建立了非常密切的相关性。目的是使用这个框架来系统地开发软件,这样通过设计可以减少模块间的错误传播。
{"title":"Assessing inter-modular error propagation in distributed software","authors":"A. Jhumka, M. Hiller, N. Suri","doi":"10.1109/RELDIS.2001.969769","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969769","url":null,"abstract":"With the functionality of most embedded systems based on software (SW), interactions amongst SW modules arise, resulting in error propagation across them. During SW development, it would be helpful to have a framework that clearly demonstrates the error propagation and containment capabilities of the different SW components. In this paper, we assess the impact of inter-modular error propagation. Adopting a white-box SW approach, we make the following contributions: (a) we study and characterize the error propagation process and derive a set of metrics that quantitatively represents the inter-modular SW interactions, (b) we use a real embedded target system used in an aircraft arrestment system to perform fault-injection experiments to obtain experimental values for the metrics proposed, (c) we show how the set of metrics can be used to obtain the required analytical framework for error propagation analysis. We find that the derived analytical framework establishes a very close correlation between the analytical and experimental values obtained. The intent is to use this framework to be able to systematically develop SW such that inter-modular error propagation is reduced by design.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115459466","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Application of commercial-grade digital equipment in nuclear power plant safety systems 商用级数字化设备在核电厂安全系统中的应用
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969772
M. Chiramal
Due to obsolescence, increasing maintenance costs, and the lack of qualified spare parts for the equipment and components of the analog instrumentation and control (I&C) systems in operating domestic nuclear power plants, nuclear utilities are replacing equipment and upgrading certain I&C systems. These activities generally involve changing from analog to digital technology. In many cases commercial products offer practical solutions. Digital I&C systems have the potential to enhance safety, reliability, and availability of the plant systems and improve plant operation. However, the use of digital software-based equipment presents challenges and concerns to the U.S. nuclear industry and the Nuclear Regulatory Commission (NRC). The NRC's approach to the review and acceptance of design qualification for digital systems largely focuses on confirming that the applicant or licensee has employed a high-quality development process that incorporated disciplined specification and implementation of design requirements. Inspection and testing is used to verify correct implementation and to validate the desired functionality of the final product.
由于国内核电站运行中的模拟仪表和控制(I&C)系统的设备和组件陈旧、维护成本增加以及缺乏合格的备件,核电公司正在更换设备和升级某些I&C系统。这些活动通常涉及从模拟技术到数字技术的转变。在许多情况下,商业产品提供了实用的解决方案。数字I&C系统有可能提高工厂系统的安全性、可靠性和可用性,并改善工厂的运营。然而,使用基于数字软件的设备给美国核工业和核管理委员会(NRC)带来了挑战和担忧。NRC审查和接受数字系统设计资格的方法主要集中在确认申请人或被许可人采用了高质量的开发过程,其中包括严格的规范和设计要求的实施。检查和测试用于验证正确的实现,并验证最终产品的期望功能。
{"title":"Application of commercial-grade digital equipment in nuclear power plant safety systems","authors":"M. Chiramal","doi":"10.1109/RELDIS.2001.969772","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969772","url":null,"abstract":"Due to obsolescence, increasing maintenance costs, and the lack of qualified spare parts for the equipment and components of the analog instrumentation and control (I&C) systems in operating domestic nuclear power plants, nuclear utilities are replacing equipment and upgrading certain I&C systems. These activities generally involve changing from analog to digital technology. In many cases commercial products offer practical solutions. Digital I&C systems have the potential to enhance safety, reliability, and availability of the plant systems and improve plant operation. However, the use of digital software-based equipment presents challenges and concerns to the U.S. nuclear industry and the Nuclear Regulatory Commission (NRC). The NRC's approach to the review and acceptance of design qualification for digital systems largely focuses on confirming that the applicant or licensee has employed a high-quality development process that incorporated disciplined specification and implementation of design requirements. Inspection and testing is used to verify correct implementation and to validate the desired functionality of the final product.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133515401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
The challenge of creating productive collaborating information assurance communities via Internet research and standards 通过Internet研究和标准创建高效协作信息保障社区的挑战
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969749
J. Betser
Overviews the challenging 5-year process leading to the design, specification, and implementation of the Internet, Engineering Task Force (IETF) Intrusion Detection Working Group (IDWQ) Intrusion Exchange Protocol (IDXP). IDXP seeks to facilitate the ubiquitous interoperability of intrusion detection components across Internet enterprises. This capability is a critical enabler of successful intrusion detection for large networks. The IETF IDWG was inspired by the DARPA CIDF activity. IDXP was developed and demonstrated in recent IETF meetings and in the IEEE DISCEX (DARPA Information Survivability Conference and EXposition). In the future, we intend to incorporate event correlation into IDXP. The process of achieving technical and organizational consensus among the segmented communities that comprise the information assurance community has been exceedingly challenging. The paper addresses the driving factors for this situation, and analyses the reasons for the ultimate community success in getting the process on the road. It is hoped that this experience would be useful in other technical disciplines facing large collaborative challenges within large secure distributed environments.
概述了Internet工程任务组(IETF)入侵检测工作组(IDWQ)入侵交换协议(IDXP)的设计、规范和实现的具有挑战性的5年过程。IDXP寻求促进跨Internet企业的入侵检测组件的普遍互操作性。此功能是大型网络成功进行入侵检测的关键因素。IETF IDWG受到DARPA CIDF活动的启发。IDXP在最近的IETF会议和IEEE DISCEX (DARPA信息生存能力会议和博览会)上得到了开发和演示。在未来,我们打算将事件关联合并到IDXP中。在组成信息保障社区的分段社区之间实现技术和组织共识的过程非常具有挑战性。本文阐述了造成这种情况的驱动因素,并分析了社区最终成功启动该过程的原因。希望这一经验对在大型安全分布式环境中面临大型协作挑战的其他技术学科有用。
{"title":"The challenge of creating productive collaborating information assurance communities via Internet research and standards","authors":"J. Betser","doi":"10.1109/RELDIS.2001.969749","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969749","url":null,"abstract":"Overviews the challenging 5-year process leading to the design, specification, and implementation of the Internet, Engineering Task Force (IETF) Intrusion Detection Working Group (IDWQ) Intrusion Exchange Protocol (IDXP). IDXP seeks to facilitate the ubiquitous interoperability of intrusion detection components across Internet enterprises. This capability is a critical enabler of successful intrusion detection for large networks. The IETF IDWG was inspired by the DARPA CIDF activity. IDXP was developed and demonstrated in recent IETF meetings and in the IEEE DISCEX (DARPA Information Survivability Conference and EXposition). In the future, we intend to incorporate event correlation into IDXP. The process of achieving technical and organizational consensus among the segmented communities that comprise the information assurance community has been exceedingly challenging. The paper addresses the driving factors for this situation, and analyses the reasons for the ultimate community success in getting the process on the road. It is hoped that this experience would be useful in other technical disciplines facing large collaborative challenges within large secure distributed environments.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132168534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Consensus with written messages under link faults 在链路故障下与书面消息达成一致
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.970768
Bettina Weiss, U. Schmid
This paper shows that deterministic consensus with written messages is possible in presence of link faults and compromised signatures. Relying upon a suitable perception-based hybrid fault model that provides different categories for both node and link faults, we prove that the authenticated Byzantine agreement algorithms OMHA and ZA of Gong, Lincoln and Rushby (1995) can be made resilient to f/sub l/ link faults per node by adding 3f/sub l/ and 2f/sub l/ nodes, respectively. Both algorithms can also cope with compromised signatures if the affected nodes are considered as arbitrary faulty. Authenticated algorithms for consensus are therefore reasonably applicable even in wireless systems, where link faults and intrusions are the dominating source of errors.
本文证明了在存在链路故障和妥协签名的情况下,与书面消息的确定性一致是可能的。基于一个合适的基于感知的混合故障模型,该模型为节点和链路故障提供了不同的类别,我们证明了Gong, Lincoln和Rushby(1995)的认证拜占庭协议算法OMHA和ZA通过分别添加3f/sub /和2f/sub /节点,可以使每个节点对f/sub / l/链路故障具有弹性。如果受影响的节点被认为是任意故障,这两种算法还可以处理受损的签名。因此,即使在链路故障和入侵是主要错误来源的无线系统中,协商一致的认证算法也合理地适用。
{"title":"Consensus with written messages under link faults","authors":"Bettina Weiss, U. Schmid","doi":"10.1109/RELDIS.2001.970768","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.970768","url":null,"abstract":"This paper shows that deterministic consensus with written messages is possible in presence of link faults and compromised signatures. Relying upon a suitable perception-based hybrid fault model that provides different categories for both node and link faults, we prove that the authenticated Byzantine agreement algorithms OMHA and ZA of Gong, Lincoln and Rushby (1995) can be made resilient to f/sub l/ link faults per node by adding 3f/sub l/ and 2f/sub l/ nodes, respectively. Both algorithms can also cope with compromised signatures if the affected nodes are considered as arbitrary faulty. Authenticated algorithms for consensus are therefore reasonably applicable even in wireless systems, where link faults and intrusions are the dominating source of errors.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116899198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Polynomial time synthesis of Byzantine agreement 拜占庭协议的多项式时间合成
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969767
S. Kulkarni, A. Arora, Arun Chippada
We present a polynomial time algorithm for automatic synthesis of fault-tolerant distributed programs, starting from fault-intolerant versions of those programs. Since this synthesis problem is known to be NP-hard, our algorithm relies on heuristics to reduce the complexity. We demonstrate that our algorithm is able to synthesize an agreement program that tolerates a Byzantine fault.
我们提出了一种多项式时间算法,用于自动合成容错分布式程序,从这些程序的容错版本开始。由于已知这个综合问题是np困难的,我们的算法依赖于启发式来降低复杂性。我们证明了我们的算法能够合成一个容忍拜占庭错误的协议程序。
{"title":"Polynomial time synthesis of Byzantine agreement","authors":"S. Kulkarni, A. Arora, Arun Chippada","doi":"10.1109/RELDIS.2001.969767","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969767","url":null,"abstract":"We present a polynomial time algorithm for automatic synthesis of fault-tolerant distributed programs, starting from fault-intolerant versions of those programs. Since this synthesis problem is known to be NP-hard, our algorithm relies on heuristics to reduce the complexity. We demonstrate that our algorithm is able to synthesize an agreement program that tolerates a Byzantine fault.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127132149","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Applying fault-tolerance principles to security research 将容错原则应用于安全性研究
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969746
A. Bhargava, B. Bhargava
There has been much focus on building secure distributed systems. The CERIAS center has been established at Purdue along with 14 other such centers in USA. We note that many of the ideas, concepts, algorithms being proposed in security have many common threads with reliability. We need to apply the science and engineering of reliability research to the research in security and vice versa. We briefly give some examples to illustrate the ideas. To increase reliability in distributed systems, the use of quorums allows the transactions to read and write replicas even if some replicas have failed or are unavailable. So the systems manage the replicas so that a forum can be formed in the presence of failures. To make systems secure against unauthorized access, one can use the reverse strategy of making it difficult to form quorums. All accesses require permission from a group of authorities who could coordinate to deny a yes majority vote.
构建安全的分布式系统一直备受关注。CERIAS中心已在普渡大学与美国其他14个此类中心一起建立。我们注意到,安全性中提出的许多思想、概念和算法都有许多与可靠性有关的共同线索。我们需要将可靠性研究的科学和工程应用到安全性研究中,反之亦然。我们简单地给出一些例子来说明这些思想。为了提高分布式系统的可靠性,仲裁的使用允许事务读取和写入副本,即使某些副本已经失败或不可用。因此,系统管理副本,以便在出现故障时形成论坛。为了使系统免受未经授权的访问,可以使用相反的策略,使其难以形成quorum。所有访问都需要获得一组权威机构的许可,这些权威机构可以协调一致,拒绝“赞成”多数投票。
{"title":"Applying fault-tolerance principles to security research","authors":"A. Bhargava, B. Bhargava","doi":"10.1109/RELDIS.2001.969746","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969746","url":null,"abstract":"There has been much focus on building secure distributed systems. The CERIAS center has been established at Purdue along with 14 other such centers in USA. We note that many of the ideas, concepts, algorithms being proposed in security have many common threads with reliability. We need to apply the science and engineering of reliability research to the research in security and vice versa. We briefly give some examples to illustrate the ideas. To increase reliability in distributed systems, the use of quorums allows the transactions to read and write replicas even if some replicas have failed or are unavailable. So the systems manage the replicas so that a forum can be formed in the presence of failures. To make systems secure against unauthorized access, one can use the reverse strategy of making it difficult to form quorums. All accesses require permission from a group of authorities who could coordinate to deny a yes majority vote.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130340071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A microkernel middleware architecture for distributed embedded real-time systems 分布式嵌入式实时系统的微内核中间件体系结构
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.970772
U. Brinkschulte, A. Bechina, F. Picioroaga, E. Schneider, T. Ungerer, J. Kreuzinger, M. Pfeffer
Today more and more embedded real-time systems are implemented in a distributed way. These distributed embedded systems consist of a few controllers up to several hundreds. Distribution and parallelism in the design of embedded real-time systems increase the engineering challenges and require new methodological framework based on middleware. Our research work focuses on the development of a middleware that supports the design of heterogeneous distributed real-time systems and allows the use of small microcontrollers as computation nodes. Our study is aimed to a new approach that led to the development of OSA+-a scalable service-oriented real-time middleware architecture. This middleware has been used as the basic platform for different domain applications: (i) conception of an autonomous guided vehicle system based on multithreaded Java microcontrollers and (ii) development of a permanent monitoring distributed system for an oil drilling application. This paper presents the basic architecture of OSA+ and its implementation for the distributed real-time embedded systems design.
目前,越来越多的嵌入式实时系统采用分布式方式实现。这些分布式嵌入式系统由几个控制器到几百个控制器组成。嵌入式实时系统设计中的分布式和并行性增加了工程挑战,需要基于中间件的新的方法框架。我们的研究工作集中于中间件的开发,该中间件支持异构分布式实时系统的设计,并允许使用小型微控制器作为计算节点。我们的研究旨在找到一种新的方法来开发OSA+——一种可扩展的面向服务的实时中间件体系结构。该中间件已被用作不同领域应用的基本平台:(i)基于多线程Java微控制器的自主引导车辆系统的概念;(ii)为石油钻井应用开发永久监控分布式系统。本文介绍了OSA+的基本体系结构及其在分布式实时嵌入式系统设计中的实现。
{"title":"A microkernel middleware architecture for distributed embedded real-time systems","authors":"U. Brinkschulte, A. Bechina, F. Picioroaga, E. Schneider, T. Ungerer, J. Kreuzinger, M. Pfeffer","doi":"10.1109/RELDIS.2001.970772","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.970772","url":null,"abstract":"Today more and more embedded real-time systems are implemented in a distributed way. These distributed embedded systems consist of a few controllers up to several hundreds. Distribution and parallelism in the design of embedded real-time systems increase the engineering challenges and require new methodological framework based on middleware. Our research work focuses on the development of a middleware that supports the design of heterogeneous distributed real-time systems and allows the use of small microcontrollers as computation nodes. Our study is aimed to a new approach that led to the development of OSA+-a scalable service-oriented real-time middleware architecture. This middleware has been used as the basic platform for different domain applications: (i) conception of an autonomous guided vehicle system based on multithreaded Java microcontrollers and (ii) development of a permanent monitoring distributed system for an oil drilling application. This paper presents the basic architecture of OSA+ and its implementation for the distributed real-time embedded systems design.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"92 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114562627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 19
Using the timely computing base for dependable QoS adaptation 利用及时的计算基础进行可靠的QoS适配
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.970771
A. Casimiro, P. Veríssimo
In open and heterogeneous environments, where an unpredictable number of applications compete for a limited amount of resources, executions can be affected by also unpredictable delays, which may not even be bounded. Since many of these applications have timeliness requirements, they can only be implemented if they are able to adapt to the existing conditions. We present a novel approach, called dependable QoS adaptation, which can only be achieved if the environment is accurately and reliably observed. Dependable QoS adaptation is based on the timely computing base (TCB) model. The TCB model is a partial quality of service synchrony model that adequately characterizes environments of uncertain synchrony and allows, at the same time, the specification and verification of timeliness requirements. We introduce the coverage stability property and show that adaptive applications can use the TCB to dependably adapt and enjoy this property. We describe the characteristics and the interface of a QoS coverage service and discuss its implementation details.
在开放和异构环境中,数量不可预测的应用程序争夺数量有限的资源,执行可能受到同样不可预测的延迟的影响,甚至可能没有限制。由于这些应用程序中的许多都具有及时性要求,因此只有在能够适应现有条件的情况下才能实现它们。我们提出了一种新的方法,称为可靠的QoS适应,它只能在环境被准确和可靠地观察到的情况下实现。可靠的QoS自适应基于及时计算基(TCB)模型。TCB模型是部分服务质量同步模型,它充分表征了不确定同步的环境,同时允许对及时性需求进行规范和验证。我们介绍了覆盖稳定性,并证明自适应应用可以使用TCB可靠地适应并享受这一特性。我们描述了QoS覆盖服务的特征和接口,并讨论了其实现细节。
{"title":"Using the timely computing base for dependable QoS adaptation","authors":"A. Casimiro, P. Veríssimo","doi":"10.1109/RELDIS.2001.970771","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.970771","url":null,"abstract":"In open and heterogeneous environments, where an unpredictable number of applications compete for a limited amount of resources, executions can be affected by also unpredictable delays, which may not even be bounded. Since many of these applications have timeliness requirements, they can only be implemented if they are able to adapt to the existing conditions. We present a novel approach, called dependable QoS adaptation, which can only be achieved if the environment is accurately and reliably observed. Dependable QoS adaptation is based on the timely computing base (TCB) model. The TCB model is a partial quality of service synchrony model that adequately characterizes environments of uncertain synchrony and allows, at the same time, the specification and verification of timeliness requirements. We introduce the coverage stability property and show that adaptive applications can use the TCB to dependably adapt and enjoy this property. We describe the characteristics and the interface of a QoS coverage service and discuss its implementation details.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123344542","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Designing a robust namespace for distributed file services 为分布式文件服务设计健壮的名称空间
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.969770
Zheng Zhang, C. Karamanolis
A number of ongoing research projects follow a partition-based approach to provide highly scalable distributed storage services. These systems maintain namespaces that reference objects distributed across multiple locations in the system. Typically, atomic commitment protocols, such as 2-phase commit, are used for updating the namespace, in order to guarantee its consistency even in the presence of failures. Atomic commitment protocols are known to impose a high overhead to failure-free execution. Furthermore, they use conservative recovery procedures and may considerably restrict the concurrency of overlapping operations in the system. This paper proposes a set of new protocols implementing the fundamental operations in a distributed namespace. The protocols impose a minimal overhead to failure-free execution. They are robust against both communication and host failures, and use aggressive recovery procedures to re-execute incomplete operations. The proposed protocols are compared with their 2-phase commit counterparts and are shown to outperform them in all critical performance factors: communication round-trips, synchronous I/O, operation concurrency.
许多正在进行的研究项目都遵循基于分区的方法来提供高度可伸缩的分布式存储服务。这些系统维护的名称空间引用了分布在系统中多个位置的对象。通常,原子提交协议(例如两阶段提交)用于更新名称空间,以便即使在出现故障时也能保证其一致性。众所周知,原子提交协议会给无故障执行带来很高的开销。此外,它们使用保守的恢复过程,可能会大大限制系统中重叠操作的并发性。本文提出了一套新的协议来实现分布式命名空间中的基本操作。协议对无故障执行的开销最小。它们对通信和主机故障都很健壮,并且使用积极的恢复过程来重新执行未完成的操作。将所提出的协议与两阶段提交的协议进行比较,结果显示在所有关键性能因素上都优于它们:通信往返、同步I/O、操作并发性。
{"title":"Designing a robust namespace for distributed file services","authors":"Zheng Zhang, C. Karamanolis","doi":"10.1109/RELDIS.2001.969770","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.969770","url":null,"abstract":"A number of ongoing research projects follow a partition-based approach to provide highly scalable distributed storage services. These systems maintain namespaces that reference objects distributed across multiple locations in the system. Typically, atomic commitment protocols, such as 2-phase commit, are used for updating the namespace, in order to guarantee its consistency even in the presence of failures. Atomic commitment protocols are known to impose a high overhead to failure-free execution. Furthermore, they use conservative recovery procedures and may considerably restrict the concurrency of overlapping operations in the system. This paper proposes a set of new protocols implementing the fundamental operations in a distributed namespace. The protocols impose a minimal overhead to failure-free execution. They are robust against both communication and host failures, and use aggressive recovery procedures to re-execute incomplete operations. The proposed protocols are compared with their 2-phase commit counterparts and are shown to outperform them in all critical performance factors: communication round-trips, synchronous I/O, operation concurrency.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"1013 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123115010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Performance analysis of the CORBA notification service CORBA通知服务的性能分析
Pub Date : 2001-10-28 DOI: 10.1109/RELDIS.2001.970773
S. Ramani, Kishor S. Trivedi, B. Dasarathy
As CORBA (Common Object Request Broker Architecture) gains popularity as a standard for portable, distributed, object-oriented computing, the need for a CORBA messaging solution is being increasingly felt. This led the Object Management Group (OMQ) to specify a Notification Service that aims to provide a more flexible and robust messaging solution than the earlier Event Service. The Notification Service provides several configurable quality of service (QoS) and administrative settings that deal with issues such as reliability, event (message) delivery order and discard policies. Unlike in conventional queuing systems, some Notification Service QoS configurations can lead to discards from within the internal queues, requiring careful analysis and configuration if such discards are to be avoided or minimized. This paper presents stochastic models (based on continuous time Markov chains and queuing theory) to analyze the Notification Service delivery and discard policies in detail.
随着CORBA(公共对象请求代理体系结构)作为可移植的、分布式的、面向对象的计算标准越来越受欢迎,人们越来越感觉到对CORBA消息传递解决方案的需求。这使得对象管理组(OMQ)指定了一个通知服务,该服务旨在提供比早期的事件服务更灵活、更健壮的消息传递解决方案。Notification Service提供了几个可配置的服务质量(QoS)和管理设置,用于处理诸如可靠性、事件(消息)传递顺序和丢弃策略等问题。与传统队列系统不同,某些Notification Service QoS配置可能导致内部队列中的丢弃,如果要避免或最小化此类丢弃,则需要仔细分析和配置。本文提出了基于连续时间马尔可夫链和排队论的随机模型来详细分析通知服务的交付和丢弃策略。
{"title":"Performance analysis of the CORBA notification service","authors":"S. Ramani, Kishor S. Trivedi, B. Dasarathy","doi":"10.1109/RELDIS.2001.970773","DOIUrl":"https://doi.org/10.1109/RELDIS.2001.970773","url":null,"abstract":"As CORBA (Common Object Request Broker Architecture) gains popularity as a standard for portable, distributed, object-oriented computing, the need for a CORBA messaging solution is being increasingly felt. This led the Object Management Group (OMQ) to specify a Notification Service that aims to provide a more flexible and robust messaging solution than the earlier Event Service. The Notification Service provides several configurable quality of service (QoS) and administrative settings that deal with issues such as reliability, event (message) delivery order and discard policies. Unlike in conventional queuing systems, some Notification Service QoS configurations can lead to discards from within the internal queues, requiring careful analysis and configuration if such discards are to be avoided or minimized. This paper presents stochastic models (based on continuous time Markov chains and queuing theory) to analyze the Notification Service delivery and discard policies in detail.","PeriodicalId":440881,"journal":{"name":"Proceedings 20th IEEE Symposium on Reliable Distributed Systems","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125157452","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Proceedings 20th IEEE Symposium on Reliable Distributed Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1