Proceedings of 16th International Conference on Distributed Computing Systems最新文献

英文中文

How to recover efficiently and asynchronously when optimism fails 当乐观情绪失败时，如何高效异步地恢复

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507907

O. Damani, V. Garg

We propose a new algorithm for recovering asynchronously from failures in a distributed computation. Our algorithm is based on two novel concepts-a fault-tolerant vector clock to maintain causality information in spite of failures, and a history mechanism to detect orphan states and obsolete messages. These two mechanisms together with checkpointing and message-logging are used to restore the system to a consistent state after a failure of one or more processes. Our algorithm is completely asynchronous. It handles multiple failures, does not assume any message ordering, causes the minimum amount of rollback and restores the maximum recoverable state with low overhead. Earlier optimistic protocols lack one or more of the above properties.

提出了一种分布式计算中异步恢复故障的新算法。我们的算法基于两个新概念——一个容错矢量时钟，用于在故障情况下保持因果关系信息;一个历史机制，用于检测孤立状态和过时消息。这两种机制与检查点和消息日志一起用于在一个或多个进程失败后将系统恢复到一致状态。我们的算法完全异步。它处理多个故障，不假定任何消息排序，导致最小数量的回滚，并以低开销恢复最大可恢复状态。早期的乐观协议缺少上述一个或多个属性。

引用次数: 74

Reliable communication in cube-based multicomputers using safety vectors 基于多维数据集的多计算机安全矢量可靠通信

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507914

Jie Wu

Reliable communication in cube-based multicomputers (including disconnected ones) using the safety vector concept is studied in this paper. In the proposed approach each node in a cube-based multicomputer of dimension n is associated with a safety vector of n binary numbers which is an approximated measure of the number and distribution of faults in the neighborhood. The safety vector of each node in an n-dimensional hypercube can be easily calculated through n-1 rounds of information exchange among neighboring nodes. Optimal unicasting between two nodes is guaranteed if the kth bit of the safety vector of the source node is one, where k is the Hamming distance between the source and the destination. An extended deadlock-free unicasting using virtual channels is also introduced.

本文研究了基于多维数据集的多计算机(包括未连接的多计算机)在安全矢量概念下的可靠通信。在该方法中，基于立方体的n维多计算机中的每个节点都与n个二进制数的安全向量相关联，该安全向量是邻域故障数量和分布的近似度量。n维超立方体中每个节点的安全向量可以通过相邻节点间的n-1轮信息交换轻松计算。如果源节点的安全向量的第k位为1，则保证两个节点之间的最优单播，其中k为源和目标之间的汉明距离。还介绍了使用虚拟信道的扩展无死锁单播。

引用次数: 6

Distributed priority queues on hypercube architectures 超多维数据集架构上的分布式优先级队列

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.508013

Sajal K. Das, M. C. Pinotti, F. Sarkar

We efficiently map a priority queue on the hypercube architecture in a load balanced manner, with no additional communication overhead. Two implementations for insert and deletemin operations are proposed on the single-port hypercube model. In a b-bandwidth, n-item priority queue in which every node contains b items in sorted order, the first implementation achieves optimal speed-up of O[min{log n, b(log n)/(log b+log log n)}] for inserting b pre-sorted items or deleting b smallest items, where b=O(n/sup 1/c/) with c>1. In particular, single insertion and deletion operations are cost-optimal and require O(log n/p+log p) time using O(log n/log log n) processors. The second implementation is more scalable since it uses a larger number of processors, and attains a 'nearly' optimal speed-up on the single-port hypercube. The insertion of log n pre-sorted items or the deletion of log n smallest items requires O(log log n)/sup 2/ time and O(log/sup 2/ n/log log n) processors. However, on the slightly more powerful pipelined hypercube model, we are able to reduce the time complexity to O(log log n) thus attaining optimal speed-up. To the best of our knowledge, our algorithms provide the first implementations of b-bandwidth distributed priority queues, which are load balanced and yet guarantee optimal speed-up.

我们以负载均衡的方式在超多维数据集架构上有效地映射优先级队列，没有额外的通信开销。在单端口超立方体模型上提出了插入和删除操作的两种实现。在带宽为b，项数为n的优先级队列中，每个节点按排序顺序包含b个项，对于插入b个预排序项或删除b个最小项，第一种实现获得了O[min{log n, b(log n)/(log b+log log n)}]的最优加速，其中b=O(n/sup 1/c/)，其中c>1。特别是，单个插入和删除操作是成本最优的，使用O(log n/log log n)个处理器需要O(log n/p+log p)时间。第二种实现更具可扩展性，因为它使用了更多的处理器，并且在单端口超立方体上实现了“近乎”最佳的加速。插入log n个预排序项或删除log n个最小项需要O(log log n)/sup 2/时间和O(log/sup 2/ n/log log n)个处理器。然而，在更强大的管道超立方体模型上，我们能够将时间复杂度降低到O(log log n)，从而获得最佳的加速。据我们所知，我们的算法提供了b带宽分布式优先级队列的第一个实现，它是负载平衡的，但保证了最佳的加速。

{"title":"Distributed priority queues on hypercube architectures","authors":"Sajal K. Das, M. C. Pinotti, F. Sarkar","doi":"10.1109/ICDCS.1996.508013","DOIUrl":"https://doi.org/10.1109/ICDCS.1996.508013","url":null,"abstract":"We efficiently map a priority queue on the hypercube architecture in a load balanced manner, with no additional communication overhead. Two implementations for insert and deletemin operations are proposed on the single-port hypercube model. In a b-bandwidth, n-item priority queue in which every node contains b items in sorted order, the first implementation achieves optimal speed-up of O[min{log n, b(log n)/(log b+log log n)}] for inserting b pre-sorted items or deleting b smallest items, where b=O(n/sup 1/c/) with c>1. In particular, single insertion and deletion operations are cost-optimal and require O(log n/p+log p) time using O(log n/log log n) processors. The second implementation is more scalable since it uses a larger number of processors, and attains a 'nearly' optimal speed-up on the single-port hypercube. The insertion of log n pre-sorted items or the deletion of log n smallest items requires O(log log n)/sup 2/ time and O(log/sup 2/ n/log log n) processors. However, on the slightly more powerful pipelined hypercube model, we are able to reduce the time complexity to O(log log n) thus attaining optimal speed-up. To the best of our knowledge, our algorithms provide the first implementations of b-bandwidth distributed priority queues, which are load balanced and yet guarantee optimal speed-up.","PeriodicalId":159322,"journal":{"name":"Proceedings of 16th International Conference on Distributed Computing Systems","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114459095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Communication compilation for unreliable networks 针对不可靠网络的通信编译

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507916

N. Islam, Amitabh Dave, R. Campbell

Parallel programs running on top of generic protocols (e.g. TCP) in a cluster of workstations often do not perform or scale as well as one would expect. One reason for this is that both the performance and scalability of parallel applications are highly dependent on the speed of communication, yet the generic protocols used to guarantee reliable message delivery add unnecessary overhead which degrades the performance of the parallel application. The main thesis we explore in this paper is that it is possible to use knowledge of application behavior to design protocols that are more efficient. In particular, we investigate automatic techniques for generating optimized application-specific network protocols for parallel applications running on unreliable networks. Our algorithms assume that the application communication can be represented by a context free grammar. Such algorithms form the basis for a communication compiler.

在工作站集群中，运行在通用协议(如TCP)之上的并行程序通常不能像人们期望的那样执行或扩展。原因之一是并行应用程序的性能和可伸缩性高度依赖于通信速度，而用于保证可靠消息传递的通用协议增加了不必要的开销，从而降低了并行应用程序的性能。我们在本文中探讨的主要论点是，利用应用程序行为的知识来设计更有效的协议是可能的。特别是，我们研究了为运行在不可靠网络上的并行应用程序生成优化的特定于应用程序的网络协议的自动技术。我们的算法假设应用程序通信可以用上下文无关的语法表示。这样的算法构成了通信编译器的基础。

引用次数: 5

Atomic recovery units: failure atomicity for logical disks 原子恢复单元:逻辑盘的故障原子性

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507898

R. Grimm, Wilson C. Hsieh, W. D. Jonge, M. Kaashoek

Atomic recovery units (ARUs) are a mechanism that allows several logical disk operations to be executed as a single atomic unit with respect to failures. For example, ARUs can be used during file creation to update several pieces of file meta-data atomically. ARUs simplify systems, as they isolate issues of atomicity within the logical disk system, ARUs are designed as part of the Logical Disk (LD), which provides an interface to disk storage that separates file and disk management by using logical block numbers and block lists. This paper discusses the semantics of concurrent ARUs, as well as the concurrency control they require. A prototype implementation in a log-structured logical disk system is presented and evaluated. The performance evaluation shows that the run-time overhead to support concurrent ARUs is negligible for Read and Write operations, and small but pronounced for file creation (4.0%-7.2%) and deletion (17.9%-20.5%) which mainly manipulate meta-data. The low overhead (when averaged over file creation, writing, reading, and deletion) for concurrent ARUs shows that issues of atomicity can be successfully isolated within the disk system.

原子恢复单元(ARUs)是一种机制，它允许多个逻辑磁盘操作作为单个原子单元执行，以应对故障。例如，可以在文件创建期间使用ARUs自动更新文件元数据的几个片段。aro简化了系统，因为它们隔离了逻辑磁盘系统中的原子性问题，aro被设计为逻辑磁盘(LD)的一部分，LD提供了一个磁盘存储接口，通过使用逻辑块编号和块列表分离文件和磁盘管理。本文讨论了并发ARUs的语义，以及它们所需要的并发控制。提出并评价了日志结构逻辑磁盘系统的原型实现。性能评估表明，支持并发ARUs的运行时开销对于读和写操作来说可以忽略不计，对于主要操作元数据的文件创建(4.0%-7.2%)和删除(17.9%-20.5%)来说很小，但明显。并发ARUs的低开销(按文件创建、写入、读取和删除的平均值计算)表明，可以在磁盘系统中成功地隔离原子性问题。

{"title":"Atomic recovery units: failure atomicity for logical disks","authors":"R. Grimm, Wilson C. Hsieh, W. D. Jonge, M. Kaashoek","doi":"10.1109/ICDCS.1996.507898","DOIUrl":"https://doi.org/10.1109/ICDCS.1996.507898","url":null,"abstract":"Atomic recovery units (ARUs) are a mechanism that allows several logical disk operations to be executed as a single atomic unit with respect to failures. For example, ARUs can be used during file creation to update several pieces of file meta-data atomically. ARUs simplify systems, as they isolate issues of atomicity within the logical disk system, ARUs are designed as part of the Logical Disk (LD), which provides an interface to disk storage that separates file and disk management by using logical block numbers and block lists. This paper discusses the semantics of concurrent ARUs, as well as the concurrency control they require. A prototype implementation in a log-structured logical disk system is presented and evaluated. The performance evaluation shows that the run-time overhead to support concurrent ARUs is negligible for Read and Write operations, and small but pronounced for file creation (4.0%-7.2%) and deletion (17.9%-20.5%) which mainly manipulate meta-data. The low overhead (when averaged over file creation, writing, reading, and deletion) for concurrent ARUs shows that issues of atomicity can be successfully isolated within the disk system.","PeriodicalId":159322,"journal":{"name":"Proceedings of 16th International Conference on Distributed Computing Systems","volume":"133 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125057406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 19

Group routing without group routing tables 没有组路由表的组路由

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507968

J. Cobb, M. Gouda

We present a group routing protocol for a network of processes. The task of the protocol is to route data messages to each member of a process group. To this end, a tree of processes is constructed in the network, ensuring each group member is included in the tree. To build this tree, the group routing protocol relies upon the unicast routing tables of each process. Thus, group routing is a composition of a unicast routing protocol, whose detailed behavior is unknown but its basic properties are given, and a protocol that builds a group tree based upon the unicast routing tables. The design of the group routing protocol is presented in three steps. First, a basic group routing protocol is presented and proven correct. Then, the protocol is refined twice, strengthening its properties with each refinement. The final protocol has the property of adapting the group tree to changes in the unicast routing tables without compromising the integrity of the group tree, even in the presence of unicast routing loops.

提出了一种进程网络的组路由协议。协议的任务是将数据消息路由到进程组的每个成员。为此，在网络中构建进程树，确保每个组成员都包含在树中。为了构建这个树，组路由协议依赖于每个进程的单播路由表。因此，组路由是单播路由协议和基于单播路由表构建组树的协议的组合。单播路由协议的详细行为是未知的，但它的基本属性是已知的。分组路由协议的设计分为三个步骤。首先，提出了一种基本的组路由协议，并证明了其正确性。然后，对协议进行两次细化，每次细化都增强了协议的性能。最终协议具有使组树适应单播路由表中的变化而不损害组树完整性的特性，即使在存在单播路由循环的情况下也是如此。

引用次数: 1

Making trust explicit in distributed commerce transactions 在分布式商业事务中明确信任

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.507925

Steven P. Ketchpel, H. Garcia-Molina

In a distributed environment where nodes are independently motivated, many transactions or commercial exchanges may be stymied due to a lack of trust between the participants. The addition of trusted intermediaries may facilitate some exchanges, but others are still problematic. We introduce a language for specifying these commercial exchange problems, and sequencing graphs, a formalism for determining whether a given exchange may occur We also present an algorithm for generating a feasible execution sequence of pairwise exchanges between parties (when it exists). Indemnities may be offered to facilitate previously infeasible transactions. We show when and how they enable commercial transactions.

在节点独立驱动的分布式环境中，由于参与者之间缺乏信任，许多交易或商业交换可能会受阻。可信中介的增加可能会促进一些交易，但其他交易仍然存在问题。我们引入了一种用于指定这些商业交换问题的语言，以及排序图，一种用于确定给定交换是否可能发生的形式化方法。我们还提出了一种算法，用于生成各方之间成对交换的可行执行序列(当它存在时)。可以提供赔偿以促进以前不可行的交易。我们将展示它们何时以及如何实现商业交易。

引用次数: 92

Energy efficient filtering of nonuniform broadcast 非均匀广播的节能滤波

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.508001

K. Tan, J. Yu

In a wireless environment, information is broadcast on communication channels to clients using powerful, battery-operated palmtops. To conserve the usage of energy, the information to be broadcast must be organized so that the client can selectively tune in at the desirable portion of the broadcast. Most of the existing work focus on uniform broadcast. However very often, a small amount of information is more frequently accessed by a large number of clients while the remainder are less in demand. This nonuniform access pattern poses several new issues. In this paper we examine these issues and look at how a nonuniform broadcast can be organized for selective tuning by the clients. We propose several new indexing schemes to facilitate selective tuning. A performance study is conducted to study and demonstrate the effectiveness of the proposed schemes.

在无线环境中，信息通过通信渠道传播给使用功能强大、由电池供电的掌上电脑的客户端。为了节省能源的使用，必须对要广播的信息进行组织，以便客户端可以选择性地收听广播的所需部分。现有的工作大多集中在统一广播上。然而，通常情况下，大量客户端更频繁地访问少量信息，而其余信息的需求则较少。这种不统一的访问模式带来了几个新问题。在本文中，我们将研究这些问题，并研究如何组织非统一广播以供客户端选择性调优。我们提出了几个新的索引方案，以方便选择性调优。我们进行了一项绩效研究，以研究和证明拟议计划的有效性。

引用次数: 43

An approach for constructing mobile applications using service proxies 一种使用服务代理构建移动应用程序的方法

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.508025

T. Nakajima, Akihiro Hokimoto, K. Kurihara

In this paper, we propose software architecture using a notion of service proxies, that is a new framework for constructing applications in mobile computing environment. Our framework especially takes into account applications accessing services on Internet such as WWW browsers and MBONE conference tools. In our framework, an application is partitioned into two pieces, one piece runs on a mobile computer, and another piece runs on a stationary computer, where they are connected by wireless networks that may be replaced on the fly. The piece on a stationary computer is called a service proxy. The service proxy filters or caches data from servers before transmitting the data to the piece on a mobile computer. These two pieces are constructed by composing small objects whose composition can be dynamically reconfigured by adding or removing replaceable devices. The paper also presents a toolkit using our framework for building mobile applications easily.

本文提出了一种基于服务代理的软件体系结构，这是一种构建移动计算环境下应用程序的新框架。我们的框架特别考虑了在Internet上访问服务的应用程序，如WWW浏览器和MBONE会议工具。在我们的框架中，应用程序被划分为两个部分，一部分在移动计算机上运行，另一部分在固定计算机上运行，它们通过无线网络连接，无线网络可以随时更换。固定计算机上的部分称为服务代理。服务代理在将数据传输到移动计算机上的部件之前过滤或缓存来自服务器的数据。这两个部分是通过组合小对象来构建的，这些小对象的组合可以通过添加或删除可替换的设备来动态地重新配置。本文还介绍了一个使用我们的框架轻松构建移动应用程序的工具包。

引用次数: 67

A new token passing distributed mutual exclusion algorithm 一种新的令牌传递分布式互斥算法

Proceedings of 16th International Conference on Distributed Computing Systems

Pub Date : 1996-05-27 DOI: 10.1109/ICDCS.1996.508024

S. Banerjee, Panos K. Chrysanthis

Eliminating interference between concurrently executing activities through mutual exclusion is one of the most fundamental problems in computer systems. The problem of mutual exclusion in a distributed system is especially interesting owing to the lack of global knowledge in the presence of variable communication delays. In this paper, a new token-based distributed mutual exclusion algorithm is proposed. The algorithm incurs approximately three messages at high loads, irrespective of the number of nodes N in the system. At low loads, it requires approximately N messages. The paper also addresses failure recovery issues, such as token loss.

通过互斥消除并发执行活动之间的干扰是计算机系统中最基本的问题之一。由于在存在可变通信延迟的情况下缺乏全局知识，分布式系统中的互斥问题特别有趣。本文提出了一种新的基于令牌的分布式互斥算法。该算法在高负载下产生大约三条消息，而与系统中的节点数N无关。在低负载情况下，它大约需要N条消息。本文还讨论了故障恢复问题，例如令牌丢失。

引用次数: 20

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 16th International Conference on Distributed Computing Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀