Proceedings. 14th Symposium on Reliable Distributed Systems最新文献

英文中文

The performance of consistent checkpointing in distributed shared memory systems 分布式共享内存系统中一致性检查点的性能

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526217

G. Cabillic, Gilles Muller, I. Puaut

This paper presents the design and implementation of a consistent checkpointing scheme for distributed shared memory (DSM) systems. Our approach relies on the integration of checkpoints within synchronization barriers already existing in applications; this avoids the need to introduce an additional synchronization mechanism. The main advantage of our checkpointing mechanism is that performance degradation arises only when a checkpoint is being taken; hence, the programmer can adjust the trade-off between the cost of checkpointing and the cost of longer rollbacks by adjusting the time between two successive checkpoints. The paper compares several implementations of the proposed consistent checkpointing mechanism (incremental, non-blocking, and pre-flushing) on the Intel Paragon multicomputer for several parallel scientific applications. Performance measures show that a careful optimization of the checkpointing protocol can reduce the time overhead of checkpointing from 8% to 0.04% of the application duration for a 6 mn checkpointing interval.

本文提出了一种分布式共享内存(DSM)系统的一致性检查点方案的设计与实现。我们的方法依赖于在应用程序中已经存在的同步屏障内集成检查点;这避免了引入额外同步机制的需要。我们的检查点机制的主要优点是，只有当检查点被占用时，性能才会下降;因此，程序员可以通过调整两个连续检查点之间的时间来调整检查点成本和更长的回滚成本之间的权衡。本文比较了在Intel Paragon多计算机上针对多个并行科学应用所提出的一致性检查点机制(增量、非阻塞和预刷新)的几种实现。性能度量表明，仔细优化检查点协议可以将检查点的时间开销从应用程序持续时间的8%减少到0.04%，检查点间隔为6 mn。

引用次数: 58

A correctness criterion for advanced transaction models 高级事务模型的正确性标准

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.518720

A. Rakotonirainy

The transaction concept was originally applied to database applications. Serializability theory captured transaction correctness and database objects consistency properties in a single notion. Today, increasingly sophisticated information requires new correctness criteria due to the limitation of classical serialisability theory which allows only a limited cooperation between its components. Several models relaxing the ACID (Atomicity, Consistency, Isolation, Durability) properties in a controlled manner have been developed. These approaches exploit separately the semantics properties of operations (object semantic approach) and application semantics (transaction interleaving approach). The notion of correctness can be refined with the help of the two previous approaches whilst increasing concurrency. In this paper, we will the gap between transaction and object semantic correctness criteria. We define a new class of schedule called Multilevel Relative Serialisability (MLRS) to combine the two approaches. This class of schedule preserve correctness properties defined in terms of object and transaction semantics. We use ACTA formalism to express object consistency, transaction correctness and MLRS. This work merges existing /spl Lt/relaxed/spl Gt/ transaction models into a unified concept. This concept is useful for long-lived, cooperative and hierarchical transaction models.

事务概念最初应用于数据库应用程序。可序列化性理论在单个概念中捕获事务正确性和数据库对象一致性属性。今天，由于经典的可串行性理论的限制，越来越复杂的信息需要新的正确性标准，它只允许组件之间有限的合作。已经开发了几种以受控方式放松ACID(原子性、一致性、隔离性、持久性)特性的模型。这些方法分别利用操作的语义属性(对象语义方法)和应用程序语义(事务交错方法)。在前两种方法的帮助下，正确性的概念可以得到改进，同时增加并发性。在本文中，我们将讨论事务和对象语义正确性标准之间的差距。我们定义了一个新的调度类，称为多级相对序列化(MLRS)，以结合这两种方法。这类调度保留了根据对象和事务语义定义的正确性属性。我们使用ACTA形式化来表达对象一致性、事务正确性和MLRS。这项工作将现有的/spl Lt/放宽的/spl Gt/事务模型合并为一个统一的概念。这个概念对于长期的、协作的和分层的事务模型非常有用。

{"title":"A correctness criterion for advanced transaction models","authors":"A. Rakotonirainy","doi":"10.1109/RELDIS.1995.518720","DOIUrl":"https://doi.org/10.1109/RELDIS.1995.518720","url":null,"abstract":"The transaction concept was originally applied to database applications. Serializability theory captured transaction correctness and database objects consistency properties in a single notion. Today, increasingly sophisticated information requires new correctness criteria due to the limitation of classical serialisability theory which allows only a limited cooperation between its components. Several models relaxing the ACID (Atomicity, Consistency, Isolation, Durability) properties in a controlled manner have been developed. These approaches exploit separately the semantics properties of operations (object semantic approach) and application semantics (transaction interleaving approach). The notion of correctness can be refined with the help of the two previous approaches whilst increasing concurrency. In this paper, we will the gap between transaction and object semantic correctness criteria. We define a new class of schedule called Multilevel Relative Serialisability (MLRS) to combine the two approaches. This class of schedule preserve correctness properties defined in terms of object and transaction semantics. We use ACTA formalism to express object consistency, transaction correctness and MLRS. This work merges existing /spl Lt/relaxed/spl Gt/ transaction models into a unified concept. This concept is useful for long-lived, cooperative and hierarchical transaction models.","PeriodicalId":275219,"journal":{"name":"Proceedings. 14th Symposium on Reliable Distributed Systems","volume":"145 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125851980","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance analysis of a regeneration-based dynamic voting algorithm 基于再生的动态投票算法的性能分析

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526227

Robert J. Hilderman, Howard J. Hamilton

RVC2 is a consistency control algorithm for replicated data objects in a distributed computing system. It is a dynamic voting algorithm which utilizes selective regeneration and recovery mechanisms for failed copies. Virtual copies which record information about the current state of a data object, but do not contain actual data, are used to reduce network and storage overhead. Experimental results for availability, storage cost, and message cost, obtained through simulation, are discussed. Our results show that the replacement of real copies with virtual copies has no significant impact on the availability of a data object. Neither does varying the generation threshold. We also show that high availability can be maintained without regeneration. We conclude that regeneration makes no significant contribution to the high availability of RVC2.

RVC2是分布式计算系统中复制数据对象的一致性控制算法。它是一种动态投票算法，对失败副本采用选择性再生和恢复机制。虚拟副本记录数据对象的当前状态信息，但不包含实际数据，用于减少网络和存储开销。讨论了通过仿真得到的可用性、存储成本和消息成本的实验结果。我们的结果表明，用虚拟副本替换真实副本对数据对象的可用性没有显著影响。改变生成阈值也不会。我们还展示了无需再生即可维持高可用性。我们得出结论，再生对RVC2的高可用性没有显著贡献。

引用次数: 2

A new deadlock detection algorithms for distributed real-time database systems 一种新的分布式实时数据库死锁检测算法

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526222

C. Yeung, S. Hung

Recently the concurrency control issue of real-time transactions is gaining increasing attention of researchers in the database community. One of the major design issue in concurrency control of real-time transactions is the resolution of local as well as distributed deadlocks while at the same time meeting the timing requirements of the transactions. In this paper, a new deadlock detection algorithm specially designed for distributed real-time database systems is proposed. The performance of the proposed algorithm is evaluated through extensive simulation experiments. Studies have also been carried out to compare the performance of the real-time deadlock detection algorithm with a non real-time algorithm for both firm and soft real-time transactions. Results indicated that the real-time deadlock detection algorithm performs better than the non real-lime deadlock detection algorithm. Results also indicated that the performance of the new algorithm is substantially better for soft real-time than that of firm real-time systems.

近年来，实时事务的并发控制问题越来越受到数据库界研究人员的关注。实时事务并发控制的主要设计问题之一是解决本地和分布式死锁，同时满足事务的时间要求。本文提出了一种专门针对分布式实时数据库系统的死锁检测算法。通过大量的仿真实验对该算法的性能进行了评价。还进行了研究，比较实时死锁检测算法与非实时算法在硬实时和软实时事务中的性能。结果表明，实时死锁检测算法优于非实时死锁检测算法。结果还表明，新算法的软实时性能明显优于硬实时系统。

引用次数: 6

Supporting semantics-based transaction processing in mobile database applications 支持移动数据库应用程序中基于语义的事务处理

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.518721

Gary D. Walborn, Panos K. Chrysanthis

Advances in computer and telecommunication technologies have made mobile computing a reality. However, greater mobility implies a more tenuous network connection and a higher rate of disconnection. In order to tolerate disconnections as well as to reduce the delays and cost of wireless communication, it is necessary to support autonomous mobile operations on data shared by stationary hosts. This would allow the part of a computation executing on a mobile host to continue executing while the mobile host is not connected to the network. In this paper, we examine whether object semantics can be exploited to facilitate autonomous and disconnected operation in mobile database applications. We define the class of fragmentable objects which may be split among a number of sites, operated upon independently at each site, and then recombined in a semantically consistent fashion. A number of objects with such characteristics are presented and an implementation of fragmentable stacks is shown and discussed.

计算机和电信技术的进步使移动计算成为现实。然而，更大的移动性意味着更脆弱的网络连接和更高的断开率。为了容忍断开连接以及减少无线通信的延迟和成本，有必要支持固定主机共享数据的自主移动操作。这将允许在移动主机上执行的计算部分在移动主机未连接到网络时继续执行。在本文中，我们研究了对象语义是否可以被利用来促进移动数据库应用程序中的自主和断开连接操作。我们定义了一类可分割的对象，这些对象可以在许多站点之间分裂，在每个站点独立操作，然后以语义一致的方式重新组合。提出了许多具有这种特征的对象，并展示和讨论了可分段堆栈的实现。

引用次数: 120

Failure detection algorithms for a reliable execution of parallel programs 故障检测算法的可靠执行并行程序

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526230

S. Chabridon, E. Gelenbe

We report on the design and simulation of novel algorithms which will ensure that application software runs correctly on a MIMD system in which processing units (PU) can fail. The effect of these algorithms is evaluated for random task graphs using simulation as failure rates increase. An example of a specific application is also examined (the Fast Fourier Transform) for which we construct the task graph and then simulate its execution under various values of the failure rates of processors.

我们报告了新算法的设计和仿真，这些算法将确保应用软件在处理单元(PU)可能出现故障的MIMD系统上正确运行。随着故障率的增加，这些算法对随机任务图的效果进行了评估。还研究了一个特定应用程序的示例(快速傅立叶变换)，我们为此构建了任务图，然后模拟其在不同处理器故障率值下的执行。

引用次数: 16

An integer programmimg approach for assigning votes in a distributed system 在分布式系统中分配选票的整数编程方法

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526220

D. Venkaiah, P. Jalote

Voting is a general approach to maintain consistency of replicated data under node failures and network partitions. In voting, each node as assigned a particular number of votes, and any group with majority of votes can perform operations. Votes assigned to the nodes have a significant impact on the performance of a voting system. In this report, we propose an integer programming approach for determining the vote assignment for maximizing the throughput. We use Monte-Carlo simulation to find the most likely groups formed due to partition failures and use these groups to formulate vote assignment as an integer programming problem. We have developed a tool called vote assignment tool (VAT) that implements this approach. VAT takes as input the configuration of the network, and after formulating the problem as integer programming exercise, solves it to output a vote assignment. We have tried this approach for different networks and have found that in many cases this approach assigns votes equivalent to or better than the best vote assignment given by the various heuristics.

投票是在节点故障和网络分区情况下保持复制数据一致性的一种通用方法。在投票中，每个节点被分配特定数量的选票，任何拥有多数选票的组都可以执行操作。分配给节点的选票对投票系统的性能有重大影响。在本报告中，我们提出了一种整数规划方法来确定投票分配以最大化吞吐量。我们使用蒙特卡罗模拟来找到由于分区失败而形成的最可能的组，并使用这些组将投票分配表述为整数规划问题。我们开发了一种工具，称为投票分配工具(VAT)，它实现了这种方法。增值税将网络的配置作为输入，将问题公式化为整数规划练习后，将其求解以输出投票分配。我们已经在不同的网络上尝试了这种方法，并发现在许多情况下，这种方法分配的选票等于或优于各种启发式给出的最佳投票分配。

引用次数: 3

A hierarchy of totally ordered multicasts 完全有序的组播的层次结构

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526218

U. Wilhelm, A. Schiper

The increased interest in protocols that provide a total order on message delivery has led to several different definitions of total order. In this paper we investigate these different definitions and propose a hierarchy that helps to better understand the implications of the different possibilities in terms of guarantees and communication cost. We identify two definitions: weak total order and strong total order, which are at the extremes of the proposed hierarchy, and incorporate them into a consistent design. Finally, we propose high-level algorithms based on a virtually synchronous communication environment that implement the given definitions.

对提供消息传递总顺序的协议越来越感兴趣，这导致了对总顺序的几种不同定义。在本文中，我们研究了这些不同的定义，并提出了一个层次结构，以帮助更好地理解在保证和通信成本方面不同可能性的含义。我们确定了两个定义:弱总顺序和强总顺序，这是在提出的层次结构的极端，并将它们合并到一个一致的设计中。最后，我们提出了基于虚拟同步通信环境的高级算法，实现了给定的定义。

引用次数: 51

Maximum and minimum consistent global checkpoints and their applications 最大和最小一致的全局检查点及其应用程序

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.526216

Yi-Min Wang

This paper considers the problem of constructing the maximum and the minimum consistent global checkpoints that contain a target set of checkpoints, and identify it as a generic issue in recovery-related applications. We formulate the problem as a reachability analysis problem on a directed rollback-dependency graph, and develop efficient algorithms to calculate the two consistent global checkpoints for both general nondeterministic executions and piecewise deterministic executions. We also demonstrate that the approach provides a generalization and unifying framework for many existing and potential applications including software error recovery, mobile computing recovery, parallel debugging and output commits.

本文研究了包含目标检查点集的最大和最小一致全局检查点的构造问题，并将其视为恢复相关应用中的一个通用问题。我们将该问题表述为有向回滚依赖图上的可达性分析问题，并开发了有效的算法来计算一般不确定性执行和分段确定性执行的两个一致的全局检查点。我们还证明了该方法为许多现有和潜在的应用提供了一个泛化和统一的框架，包括软件错误恢复、移动计算恢复、并行调试和输出提交。

引用次数: 36

On the design of systems of cooperating functional processes 协同功能流程系统的设计

Proceedings. 14th Symposium on Reliable Distributed Systems

Pub Date : 1995-09-13 DOI: 10.1109/RELDIS.1995.518723

Claus Aßmann, W. Kluge

This paper describes a design concept for systems of cooperating distributed processes based on a variant of coloured Petri-nets. It cleanly separates graphical specification of processes and their interaction (or communication) from the algorithmic specifications of the computations that need to be performed by the individual processes. Designing complex process systems is aided by abstractions similar to those that are available in programming languages. In conjunction with a small set of well-defined interaction schemes for process communication it ensures well-behaving systems largely by construction. Essential invariance properties of small subsystems which in incremental steps may either be verified by formal methods or validated by simulation are not corrupted when embedding them in the context of larger systems. The paper focuses particularly on the construction of large systems by recursive abstractions of small net templates which, at execution time, may be recursively expanded to distribute application problems evenly over several processing sites for concurrent processing.

本文提出了一种基于彩色petri网的协作分布式进程系统的设计思想。它将流程及其交互(或通信)的图形化规范与需要由单个流程执行的计算的算法规范清晰地分离开来。设计复杂的过程系统可以借助于类似于编程语言中可用的抽象。它与用于过程通信的一组定义良好的交互方案相结合，主要通过构造来确保系统的良好行为。小型子系统的基本不变性可以通过形式方法或模拟方法进行验证，但在将其嵌入到更大的系统环境中时不会受到破坏。本文特别关注通过对小型网络模板的递归抽象来构建大型系统，这些模板在执行时可以递归地扩展以将应用程序问题均匀地分布在多个处理站点上以进行并发处理。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings. 14th Symposium on Reliable Distributed Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀