Proceedings Sixth International Parallel Processing Symposium最新文献

英文中文

Parallel implementation of divide-and-conquer algorithms on binary de Bruijn networks 二进制de Bruijn网络上分治算法的并行实现

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223064

Xiaoxiong Zhong, S. Rajopadhye, V. Lo

Studies the problem of parallel implementation of divide-and-conquer algorithms on binary de Bruijn network using a temporal binomial tree (rather than the usual binary tree) computation structure. Two cases of message volumes are considered: (i) uniform, and (ii) logarithmically decreasing (increasing) weights. A single mapping is proposed for both cases. It has average extra dilation 1 and is communication link contention-free. A lower bound for the total extra dilation of any mapping from uniform-weighted binomial tree to an arbitrary degree-4 network is also developed to show that the mapping is asymptotically optimal with respective to the average extra dilation. The implementation is well suited to a binary de Bruijn network with a wormhole or circuit switching communication scheme.<>

采用时间二叉树(而不是通常的二叉树)计算结构，研究了二分de Bruijn网络上分治算法的并行实现问题。考虑消息量的两种情况:(i)均匀，和(ii)对数减少(增加)权重。对于这两种情况，建议使用单个映射。它具有平均额外膨胀1，并且通信链路无争用。给出了从等权二叉树到任意4次网络的任意映射的总额外展开的下界，证明了该映射相对于平均额外展开是渐近最优的。该实现非常适合于具有虫洞或电路交换通信方案的二进制de Bruijn网络

引用次数: 6

Distributed consensus in semi-synchronous systems 半同步系统中的分布式一致性

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222994

P. Berman, A. Bharali

The Distributed consensus problem assumes that all processors in the system have some initial values; the goal is to make all non-faulty processors agree on one of these values. This paper investigates the time needed to reach consensus in a partially synchronous model with omission failures. In this model, the processors have no direct knowledge about time, but the time between consecutive steps of each processor is always between two known constants c/sub 1/ and c/sub 2/; the ratio C=/sup c2///sub c1/ measures the timing uncertainty in the system. Moreover, messages are delivered within time d. This paper provides an improved protocol for the above problem. When the majority of the processors are fault-free, the protocol achieves consensus in time 3( phi +1)d+Cd, where phi is the actual number of faults in a specific execution of the protocol. This allows an increase in efficiency up to 25% over the existing protocol which requires time 4( phi +1)d+Cd.<>

分布式共识问题假设系统中的所有处理器都有一些初始值;目标是使所有没有故障的处理器都同意这些值中的一个。本文研究了部分同步模型中存在遗漏故障时达成共识所需的时间。在该模型中，处理器不直接知道时间，但每个处理器连续步骤之间的时间总是在两个已知常数c/下标1/和c/下标2/之间;比值C=/sup c2/// sup c1/表示系统的时序不确定性。针对上述问题，本文提出了一种改进的协议。当大多数处理器无故障时，协议在3(phi +1)d+Cd时间内达成共识，其中phi是协议在特定执行过程中的实际故障数。这使得效率比现有协议提高了25%，而现有协议需要时间4(phi +1)d+Cd。

引用次数: 5

Parallel implementation of the auction algorithm on the Intel hypercube 拍卖算法在Intel超立方体上的并行实现

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223005

N. Bagherzadeh, K. Hawk

The authors present their experience in executing the auction algorithm on an iPSC/860 hypercube multiprocessor. They show the performance of the algorithm under synchronous and asynchronous computation models. In order to reduce the number of iterations for this algorithm and effectively increase the inherent parallelism in the auction algorithm, they propose and test a new technique called gamma -scaling.<>

作者介绍了在iPSC/860超立方体多处理器上执行拍卖算法的经验。分别展示了该算法在同步和异步计算模型下的性能。为了减少该算法的迭代次数，并有效地提高拍卖算法的固有并行性，他们提出并测试了一种名为gamma -scaling的新技术

引用次数: 4

Fault-tolerant multiprocessor system routing using incomplete diagnostic information 使用不完整诊断信息的容错多处理器系统路由

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223013

D. Blough, S. Najand

Fault-tolerant routing algorithms in multiprocessor systems utilize diagnostic information in selecting paths for messages. In many situations, only incomplete, or partial, diagnostic information is available for this purpose. The authors present algorithms for achieving two forms of diagnosis, known as k-reachability diagnosis and k-neighborhood diagnosis which provide partial diagnostic information. They compare, both analytically and through experiments conducted on an Intel iPSC/2 hypercube the performance and overhead of these two algorithms. They also present a routing algorithm that successfully routes messages between connected non-faulty nodes in systems of arbitrary topology containing an arbitrary number of faults. The performance of the algorithm is shown to be optimal when k=n-1 and within a factor of two of optimal, in the worst case, when k=1.<>

多处理器系统中的容错路由算法利用诊断信息为消息选择路径。在许多情况下，只有不完整或部分的诊断信息可用于此目的。作者提出了实现两种诊断形式的算法，即提供部分诊断信息的k-可达性诊断和k-邻域诊断。他们通过分析和在Intel iPSC/2超立方体上进行的实验，比较了这两种算法的性能和开销。他们还提出了一种路由算法，可以在包含任意数量故障的任意拓扑系统中成功地在连接的无故障节点之间路由消息。当k=n-1时，该算法的性能是最优的，在最坏的情况下，当k=1时，该算法的性能在最优的两个因子内

引用次数: 5

Optimal algorithms for the vertex updating problem of a minimum spanning tree 最小生成树顶点更新问题的最优算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223028

Donald B. Johnson, P. Metaxas

The vertex updating problem for a minimum spanning tree (MST) is defined as follows: Given a graph G=(V,E/sub G/) and its MST T, update T when a new vertex z is introduced along with weighted edges that connect z with the vertices of G. The authors present a set of rules that, together with a valid tree-contraction schedule are used to produce simple optimal parallel algorithms that run in O(log n) parallel time using n/lgn EREW PRAMs where n= mod V mod . These rules can also be used to derive simple linear-time sequential algorithms for the same problem. It is also shown how this solution can be used to solve the multiple vertex updating problem: Update a given MST when k new vertices are introduced simultaneously. This problem is solved in O(lgk.lgn) parallel time using /sub lgk.lgn//sup k.n/ EREW PRAM processors.<>

顶点的最小生成树(MST)更新问题定义如下:给定一个图G = (V, E / sub G /)及其MST T, T更新当一个新的顶点z介绍以及加权边缘连接z与G的顶点作者提供的一组规则,连同一个有效tree-contraction时间表是用于生产简单的最优运行的并行算法在O (log n)平行时间使用n / lgn EREW婴儿车V mod n =国防部。这些规则也可以用来为同样的问题推导简单的线性时间序列算法。还展示了如何使用此解决方案来解决多顶点更新问题:当同时引入k个新顶点时更新给定的MST。使用/sub lgk在O(lgk.lgn)并行时间内解决了这个问题。//sup k.n/ EREW PRAM处理器

引用次数: 9

An improved multiple-path deadlock-free routing algorithm in binary hypercubes 二进制超立方体中改进的多路径无死锁路由算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223000

Qiang Li

This paper presents a multiple-path deadlock-free routing algorithm in direct binary hypercubes which is an improved version of a previously published algorithm by the author (1991). Between two nodes of distance k, the previous algorithm provides k disjoint paths in one direction and one path in the other. The direction with one path is a performance bottleneck. The new algorithm adds one more disjoint path to the narrow direction using buffer management technique, and preserves the deadlock-free property. Although only one path is added, simulation results presented in this paper show a significant performance improvement since the added path almost doubles the capacity of the bottleneck.<>

本文提出了一种直接二进制超立方体的多路径无死锁路由算法，该算法是作者(1991)先前发表的算法的改进版本。在距离为k的两个节点之间，前一算法在一个方向上提供了k条不相交的路径，在另一个方向上提供了一条路径。只有一条路径的方向是性能瓶颈。该算法利用缓冲区管理技术在窄方向上增加了一条不相交路径，并保持了无死锁的特性。虽然只增加了一条路径，但本文的仿真结果显示了显著的性能改进，因为增加的路径几乎使瓶颈的容量增加了一倍。

引用次数: 0

A more efficient message-optimal algorithm for distributed termination detection 一种更有效的分布式终端检测消息优化算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222991

T. Lai, Y. Tseng, Xuefeng Dong

Termination detection is a fundamental problem in distributed computing. Many algorithms have been proposed, but only the S. Chandrasekaran and S. Venkatesan (CV) algorithm (1990) is known to be optimal in worst-case message complexity. This optimal algorithm, however, has several undesirable properties. First, it always requires M'+2* mod E mod +n-1 control messages, whether it is worst case or best case, where M' is the number of basic messages issued by the underlying computation after the algorithm starts, mod E mod is the number of channels in the system, and n is the number of processes. Second, its worst-case detection delay is O(M'). In a message-intensive computation, that might not be tolerable. Third, the maximum amount of space needed by each process is O(M'), a quantity not known at compile time, making it necessary to use the more expensive dynamic memory allocation. Last, it works only for FIFO channels. This paper remedies these drawbacks, while keeping its strength. The authors propose an algorithm that requires M'+2(n-1) control messages in the worst case, but much fewer on the average, and in the best case, it uses only 2(n-1) control messages, no matter how large M' is.<>

终端检测是分布式计算中的一个基本问题。已经提出了许多算法，但已知只有S. Chandrasekaran和S. Venkatesan (CV)算法(1990)在最坏情况下是最优的。然而，这种最优算法有几个不理想的特性。首先，无论是最坏情况还是最佳情况，它总是需要M'+2* mod E mod +n-1条控制消息，其中M'为算法启动后底层计算发出的基本消息数，mod E mod为系统中的通道数，n为进程数。其次，其最坏情况检测延迟为O(M’)。在消息密集型计算中，这可能是不可容忍的。第三，每个进程所需的最大空间量是O(M')，这个量在编译时是未知的，因此有必要使用更昂贵的动态内存分配。最后，它只适用于FIFO通道。本文弥补了这些缺点，同时保持了其强度。作者提出了一种算法，在最坏的情况下，它需要M'+2(n-1)个控制消息，但平均来说要少得多，在最好的情况下，它只需要2(n-1)个控制消息，无论M'有多大。

{"title":"A more efficient message-optimal algorithm for distributed termination detection","authors":"T. Lai, Y. Tseng, Xuefeng Dong","doi":"10.1109/IPPS.1992.222991","DOIUrl":"https://doi.org/10.1109/IPPS.1992.222991","url":null,"abstract":"Termination detection is a fundamental problem in distributed computing. Many algorithms have been proposed, but only the S. Chandrasekaran and S. Venkatesan (CV) algorithm (1990) is known to be optimal in worst-case message complexity. This optimal algorithm, however, has several undesirable properties. First, it always requires M'+2* mod E mod +n-1 control messages, whether it is worst case or best case, where M' is the number of basic messages issued by the underlying computation after the algorithm starts, mod E mod is the number of channels in the system, and n is the number of processes. Second, its worst-case detection delay is O(M'). In a message-intensive computation, that might not be tolerable. Third, the maximum amount of space needed by each process is O(M'), a quantity not known at compile time, making it necessary to use the more expensive dynamic memory allocation. Last, it works only for FIFO channels. This paper remedies these drawbacks, while keeping its strength. The authors propose an algorithm that requires M'+2(n-1) control messages in the worst case, but much fewer on the average, and in the best case, it uses only 2(n-1) control messages, no matter how large M' is.<<ETX>>","PeriodicalId":340070,"journal":{"name":"Proceedings Sixth International Parallel Processing Symposium","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1992-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133386710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 25

A software tool for cellular mapping of discrete unitary transforms 离散酉变换的元胞映射软件工具

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223029

G. Miel, E. Yfantis

The paper describes a software tool that facilitates mapping onto array processors of a wide class of unitary transforms. The mapping formalism of the tool depends on matrix factorizations combined with abstract constructs that link the linear concepts to a model of the array's architecture. A prototype design of the tool is graphics-based and user-driven.<>

本文描述了一个软件工具，可以方便地将一类广泛的酉变换映射到阵列处理器上。该工具的映射形式化依赖于矩阵分解和将线性概念链接到数组体系结构模型的抽象构造。该工具的原型设计是基于图形和用户驱动的。

引用次数: 1

Quadtree building algorithms on an SIMD hypercube SIMD超立方体上的四叉树构建算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223077

O. Ibarra, M. Kim

Presents O(log n) time SIMD hypercube algorithms for transforming binary images to linear quadtrees and vice versa, where n is the size of the images as well as the number of hypercube nodes. The quadtree building algorithm, which generates the locational codes in preorder, is an improvement of a recently reported algorithm that runs in O(log/sup 2/n) time. The authors also give an optimal linear quadtree building algorithm which runs in T(n) time using n/sup 2//T(n) processors for log n>

提出了O(log n)时间SIMD超立方体算法，用于将二值图像转换为线性四叉树，反之亦然，其中n是图像的大小以及超立方体节点的数量。四叉树构建算法可以预先生成位置代码，是对最近报道的运行时间为O(log/sup 2/n)的算法的改进。作者还给出了一个最优的线性四叉树构建算法，该算法在T(n)时间内运行，使用n/sup 2//T(n)处理器，log n>

引用次数: 6

Optimal aspect ratio and number of separable row/column buses for mesh-connected parallel computers 网格连接并行计算机的最佳纵横比和可分离行/列总线数量

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223023

M. Serrano, B. Parhami

A two-dimensional mesh of PEs with separable row and column buses has been shown to be quite effective for semigroup, prefix, and a wide class of other parallel computations. The authors show how semigroup and prefix computations can be performed with the same asymptotic time complexity on meshes having separable buses for a subset of rows and columns. They find that with this basic arrangement, square grids are not optimal but that a hierarchical method of synthesizing large meshes builds optimal square meshes from rectangular submeshes. The time-complexity results are shown to correspond to those previously published when certain parameters of the design are fixed at special values.<>

具有可分离行和列总线的二维pe网格已被证明对半群、前缀和其他广泛的并行计算非常有效。作者展示了如何在对行和列的子集具有可分离总线的网格上以相同的渐近时间复杂度执行半群和前缀计算。他们发现，在这种基本布局下，正方形网格不是最优的，但一种综合大网格的分层方法可以从矩形子网格中构建最优的正方形网格。当设计的某些参数固定在特定值时，时间复杂度结果与先前公布的结果相对应

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Sixth International Parallel Processing Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀