2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)最新文献

英文中文

Performance evaluation of a fair fault-tolerant mutual exclusion algorithm 公平容错互斥算法的性能评价

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.35

Julien Sopena, L. Arantes, Pierre Sens

This paper presents an efficient and fair fault-tolerant token-based algorithm for achieving mutual exclusion. It is an extension of the Naimi-Trehel algorithm that uses a distributed queue of token requests and a dynamic tree. In case of failures, our algorithm tries to recover the requests' queue by gathering intact portions of the one which existed just before the failure. Thus, fairness of token requests is preserved despite failures. Furthermore, the use of broadcast is minimized when rebuilding the dynamic tree. Experiment results with different fault injection scenarios show that our approach presents a fast failure recovery and low message broadcast overhead

本文提出了一种高效、公平的基于令牌的互斥容错算法。它是Naimi-Trehel算法的扩展，该算法使用令牌请求的分布式队列和动态树。在失败的情况下，我们的算法试图通过收集故障前存在的请求队列的完整部分来恢复请求队列。因此，即使失败，令牌请求的公平性也得以保留。此外，在重建动态树时尽量减少广播的使用。不同故障注入场景的实验结果表明，该方法具有快速故障恢复和低消息广播开销的优点

引用次数: 5

Topology Sensitive Replica Selection 拓扑敏感副本选择

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.46

D. Brodsky, M. Feeley, N. Hutchinson

As the disks typically found in personal computers grow larger, protecting data by replicating it on a collection of "peer" systems rather than on dedicated high performance storage systems can provide comparable reliability and availability guarantees but at reduced cost and complexity. In order to be adopted, peer-to-peer storage systems must be able to replicate data on hosts that are trusted, secure, and available. However, recent research has shown that the traditional model, where nodes are assumed to have identical levels of trust, to behave independently, and to have similar failure modes, is over simplified. Thus, there is a need for a mechanism that automatically and efficiently selects replica nodes from a large number of available hosts with varying capabilities and trust levels. In this paper we present an algorithm to handle replica node selection either for new replica groups or to replace failed replicas in a peer-to-peer storage system. We show through simulation that our algorithm maintains the node inter-connection topology minimizing the cost of recovery from a failed replica, measured by the number of nodes affected by the failure and the number of inter-node messages

随着个人计算机中常见的磁盘越来越大，通过在一组“对等”系统(而不是专用的高性能存储系统)上复制数据来保护数据，可以提供类似的可靠性和可用性保证，但成本和复杂性都降低了。为了被采用，点对点存储系统必须能够复制可信、安全且可用的主机上的数据。然而，最近的研究表明，传统的模型——假设节点具有相同的信任水平、独立的行为和相似的故障模式——过于简化了。因此，需要一种机制，能够自动有效地从具有不同功能和信任级别的大量可用主机中选择副本节点。在本文中，我们提出了一种算法来处理副本节点的选择，无论是为新的副本组或替换失效的副本在点对点存储系统。我们通过模拟表明，我们的算法维护节点互连拓扑，最小化从失败副本恢复的成本，通过受故障影响的节点数量和节点间消息的数量来衡量

{"title":"Topology Sensitive Replica Selection","authors":"D. Brodsky, M. Feeley, N. Hutchinson","doi":"10.1109/SRDS.2006.46","DOIUrl":"https://doi.org/10.1109/SRDS.2006.46","url":null,"abstract":"As the disks typically found in personal computers grow larger, protecting data by replicating it on a collection of \"peer\" systems rather than on dedicated high performance storage systems can provide comparable reliability and availability guarantees but at reduced cost and complexity. In order to be adopted, peer-to-peer storage systems must be able to replicate data on hosts that are trusted, secure, and available. However, recent research has shown that the traditional model, where nodes are assumed to have identical levels of trust, to behave independently, and to have similar failure modes, is over simplified. Thus, there is a need for a mechanism that automatically and efficiently selects replica nodes from a large number of available hosts with varying capabilities and trust levels. In this paper we present an algorithm to handle replica node selection either for new replica groups or to replace failed replicas in a peer-to-peer storage system. We show through simulation that our algorithm maintains the node inter-connection topology minimizing the cost of recovery from a failed replica, measured by the number of nodes affected by the failure and the number of inter-node messages","PeriodicalId":164765,"journal":{"name":"2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132600654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Fault-tolerant and scalable TCP splice and web server architecture 容错和可扩展的TCP拼接和web服务器架构

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.21

M. Marwah, Shivakant Mishra, C. Fetzer

This paper describes three enhancements to the TCP splicing mechanism: (1) Enable a TCP connection to be simultaneously spliced through multiple machines for higher scalability; (2) Make a spliced connection fault-tolerant to proxy failures; and (3) Provide flexibility of splitting a TCP splice between a proxy and a backend server for further increasing the scalability of a Web server system. A Web server architecture based on this enhanced TCP splicing is proposed. This architecture provides a highly scalable, seamless service to the users with minimal disruption during server failures. In addition to the traditional Web services in which users download Web pages, multimedia files and other types of data from a Web server, the proposed architecture supports newly emerging Web services that are highly interactive, and involve relatively longer, stateful client-server sessions. A prototype of this architecture has been implemented as a Linux 2.6 kernel module, and the paper presents important performance results measured from this implementation

本文介绍了对TCP拼接机制的三个改进:(1)使TCP连接可以通过多台机器同时进行拼接，以获得更高的可扩展性;(2)使拼接连接对代理故障具有容错性;(3)提供在代理服务器和后端服务器之间分割TCP拼接的灵活性，以进一步提高Web服务器系统的可伸缩性。提出了一种基于这种增强TCP拼接的Web服务器体系结构。该体系结构为用户提供高度可伸缩的无缝服务，在服务器故障期间将中断最小化。除了用户从Web服务器下载Web页面、多媒体文件和其他类型数据的传统Web服务之外，建议的体系结构还支持新兴的Web服务，这些服务具有高度交互性，并且涉及相对较长的、有状态的客户机-服务器会话。该体系结构的原型已经作为Linux 2.6内核模块实现，本文给出了从该实现中测量的重要性能结果

引用次数: 23

Improving DBMS Performance through Diverse Redundancy 通过不同的冗余提高DBMS的性能

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.27

Vladimir Stankovic, P. Popov

Database replication is widely used to improve both fault tolerance and DBMS performance. Non-diverse database replication has a significant limitation - it is effective against crash failures only. Diverse redundancy is an effective mechanism of tolerating a wider range of failures, including many non-crash failures. However it has not been adopted in practice because many see DBMS performance as the main concern. In this paper we show experimental evidence that diverse redundancy (diverse replication) can bring benefits in terms of DBMS performance, too. We report on experimental results with an optimistic architecture built with two diverse DBMSs under a load derived from TPC-C benchmark, which show that a diverse pair performs faster not only than non-diverse pairs but also than the individual copies of the DBMSs used. This result is important because it shows potential for DBMS performance better than anything achievable with the available off-the-shelf servers

数据库复制被广泛用于提高数据库管理系统的容错性和性能。非多样化数据库复制有一个明显的限制——它只对崩溃失败有效。多样化的冗余是一种有效的机制，可以容忍更大范围的故障，包括许多非崩溃故障。然而，它并没有在实践中被采用，因为许多人认为DBMS的性能是主要的关注点。在本文中，我们展示了实验证据，表明不同的冗余(不同的复制)也可以在DBMS性能方面带来好处。我们报告了在TPC-C基准测试的负载下使用两个不同的dbms构建的乐观架构的实验结果，结果表明，不同的对不仅比非不同的对执行得更快，而且比所使用的单个dbms副本执行得更快。这个结果很重要，因为它显示了DBMS性能的潜力，比可用的现成服务器所能达到的任何性能都要好

引用次数: 6

Hidden Markov Models as a Support for Diagnosis: Formalization of the Problem and Synthesis of the Solution 作为诊断支持的隐马尔可夫模型:问题的形式化和解决方案的综合

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.24

Alessandro Daidone, F. Giandomenico, A. Bondavalli, S. Chiaradonna

In modern information infrastructures, diagnosis must be able to assess the status or the extent of the damage of individual components. Traditional one-shot diagnosis is not adequate, but streams of data on component behavior need to be collected and filtered over time as done by some existing heuristics. This paper proposes instead a general framework and a formalism to model such over-time diagnosis scenarios, and to find appropriate solutions. As such, it is very beneficial to system designers to support design choices. Taking advantage of the characteristics of the hidden Markov models formalism, widely used in pattern recognition, the paper proposes a formalization of the diagnosis process, addressing the complete chain constituted by monitored component, deviation detection and state diagnosis. Hidden Markov models are well suited to represent problems where the internal state of a certain entity is not known and can only be inferred from external observations of what this entity emits. Such over-time diagnosis is a first class representative of this category of problems. The accuracy of diagnosis carried out through the proposed formalization is then discussed, as well as how to concretely use it to perform state diagnosis and allow direct comparison of alternative solutions

在现代信息基础设施中，诊断必须能够评估单个组件的状态或损坏程度。传统的一次性诊断是不够的，但需要收集和过滤组件行为的数据流，如一些现有的启发式方法所做的那样。本文提出了一个通用的框架和一个形式化的模型来模拟这种超时诊断场景，并找到适当的解决方案。因此，系统设计师支持设计选择是非常有益的。利用在模式识别中广泛应用的隐马尔可夫模型形式化的特点，提出了一种诊断过程的形式化方法，解决了由被监测部件、偏差检测和状态诊断组成的完整链。隐马尔可夫模型非常适合于表示某些实体的内部状态未知且只能从该实体发出的外部观察推断的问题。这种超时诊断是这类问题的一流代表。然后讨论了通过提出的形式化进行诊断的准确性，以及如何具体使用它来执行状态诊断并允许替代解决方案的直接比较

{"title":"Hidden Markov Models as a Support for Diagnosis: Formalization of the Problem and Synthesis of the Solution","authors":"Alessandro Daidone, F. Giandomenico, A. Bondavalli, S. Chiaradonna","doi":"10.1109/SRDS.2006.24","DOIUrl":"https://doi.org/10.1109/SRDS.2006.24","url":null,"abstract":"In modern information infrastructures, diagnosis must be able to assess the status or the extent of the damage of individual components. Traditional one-shot diagnosis is not adequate, but streams of data on component behavior need to be collected and filtered over time as done by some existing heuristics. This paper proposes instead a general framework and a formalism to model such over-time diagnosis scenarios, and to find appropriate solutions. As such, it is very beneficial to system designers to support design choices. Taking advantage of the characteristics of the hidden Markov models formalism, widely used in pattern recognition, the paper proposes a formalization of the diagnosis process, addressing the complete chain constituted by monitored component, deviation detection and state diagnosis. Hidden Markov models are well suited to represent problems where the internal state of a certain entity is not known and can only be inferred from external observations of what this entity emits. Such over-time diagnosis is a first class representative of this category of problems. The accuracy of diagnosis carried out through the proposed formalization is then discussed, as well as how to concretely use it to perform state diagnosis and allow direct comparison of alternative solutions","PeriodicalId":164765,"journal":{"name":"2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129679873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 51

SegmentShield: Exploiting Segmentation Hardware for Protecting against Buffer Overflow Attacks SegmentShield:利用分段硬件来防止缓冲区溢出攻击

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-10-02 DOI: 10.1109/SRDS.2006.43

Takahiro Shinagawa

This paper presents a strong and efficient scheme for protecting against buffer overflow attacks. The basic approach of this scheme is pointer copying: copies of code pointers are stored in a safe memory area to detect and prevent the manipulation of code pointers. In order to protect the copied code pointers from data-pointer modification attacks, this scheme exploits the segmentation hardware of IA-32 (Intel x86) processors. This scheme provides as strong protection as write-protecting the memory area via system calls. On the other hand, this scheme involves a modest overhead because copying a code pointer requires only a few user-level instructions and there is no penalty of entering the kernel. The experimental results show that the performance overhead in OpenSSL ranges from 0.9% to 4.3%

本文提出了一种强大而有效的防止缓冲区溢出攻击的方案。该方案的基本方法是指针复制:将代码指针的副本存储在安全的内存区域中，以检测和防止代码指针的操作。为了保护复制的代码指针不受数据指针修改攻击，该方案利用了IA-32 (Intel x86)处理器的分段硬件。该方案提供了与通过系统调用对内存区域进行写保护一样强的保护。另一方面，这种方案涉及到适度的开销，因为复制代码指针只需要几个用户级指令，并且没有进入内核的代价。实验结果表明，OpenSSL的性能开销在0.9%到4.3%之间

引用次数: 2

An SNMP based failure detection service 基于SNMP的故障检测服务

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

Pub Date : 2006-02-01 DOI: 10.1109/SRDS.2006.9

M. Wiesmann, P. Urbán, X. Défago

In this paper, we present the SNMP-FD service, a novel failure detection service entirely based on the Simple Network Management Protocol (SNMP). This approach promises better interoperability with external tools and failure information sources, including network equipment and cluster management tools. We first show how the SNMP standard can be used to build a failure detection service. We describe the already standardized interfaces that can be reused and introduce the interfaces that need to be added. SNMP is used extensively in the service for messaging, process status description, configuration, services statistics and delivering failure detection information to applications. We then present our implementation and an evaluation of performance and quality of service

本文提出了一种完全基于简单网络管理协议(SNMP)的新型故障检测服务SNMP- fd。这种方法保证了与外部工具和故障信息源(包括网络设备和集群管理工具)更好的互操作性。我们首先展示如何使用SNMP标准构建故障检测服务。我们描述了可以重用的已经标准化的接口，并介绍了需要添加的接口。SNMP在服务中广泛用于消息传递、进程状态描述、配置、服务统计和向应用程序传递故障检测信息。然后，我们介绍我们的实施和绩效和服务质量的评估

引用次数: 18

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀