[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation最新文献

英文中文

Towards efficient parallelizations of a computer algebra algorithm 计算机代数算法的高效并行化

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234903

P. Loustaunau, P.Y. Wang

The authors summarize the results of a preliminary study that examines the feasibility of implementing computer algebra systems on massively parallel single-instruction multiple-data architectures. On serial computers, these systems rely on B.Buchberger's (1970, 1985) algorithm for computing Grobner bases. A parallelization of this algorithm that addresses the potential growth in the number of polynomials that can be generated during the computation is proposed. The parallel algorithm was implemented on a Connection Machine CM-200 System. The experimental results which were obtained for seven test problems are evaluated. The results of this study provide insights into ongoing research to develop more efficient parallel algorithms for finding Grobner bases.<>

作者总结了一项初步研究的结果，该研究检验了在大规模并行单指令多数据架构上实现计算机代数系统的可行性。在串行计算机上，这些系统依靠B.Buchberger(1970,1985)的算法来计算Grobner基。该算法的并行化解决了在计算过程中可能产生的多项式数量的潜在增长。在CM-200系统上实现了并行算法。对七个试验问题的实验结果进行了评价。这项研究的结果为正在进行的研究提供了见解，以开发更有效的并行算法来寻找Grobner碱基。

引用次数: 2

Benchmarking the CM-5 multicomputer 对CM-5多计算机进行基准测试

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234900

Z. Bozkus, S. Ranka, G. Fox

The authors study the performance of the CM-5 multiprocessor. They provide a number of benchmarks for its communication and computation performance. Many of the operations, like scans and global reduction, can be performed using special hardware available on the CM-5. These operations have been benchmarked. The authors also describe how to embed a mesh and a hypercube on a CM-5 architecture and provide timings for some mesh and hypercube communication primitives on the CM-5.<>

对CM-5多处理机的性能进行了研究。它们为其通信和计算性能提供了许多基准。许多操作，如扫描和全局还原，可以使用CM-5上可用的特殊硬件来执行。这些操作已经进行了基准测试。作者还描述了如何在CM-5架构上嵌入网格和超立方体，并提供了CM-5上一些网格和超立方体通信原语的时序。

引用次数: 46

Off-line permutation scheduling on circuit-switched fixed routing networks 电路交换固定路由网络的脱机置换调度

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234935

A. Youssef

The problem of offline permutation scheduling on linear arrays, rings, hypercubes, and two-dimensional arrays, assuming the CSFR (circuit-switched fixed routing) model, is examined. Optimal permutation scheduling involves finding a minimum number of subsets of nonconflicting source-destination paths. Every subset of paths can be established to run in one pass. Optimal permutation scheduling on linear arrays is shown to be linear and on rings NP-complete. On hypercubes, the problem is NP-complete. However, the author discusses an O(N log N) algorithm that routes any permutation in two passes if the model is relaxed to allow for two routing rules, the e-cube rule and the e/sup -1/-cube rule. This complexity is reduced to O(N) hypercube-parallel time. An O(N log/sup 2/ N) bipartite-matching-based algorithm designed to schedule any permutation on p*q meshes/tori in q passes is considered.<>

研究了基于CSFR(电路交换固定路由)模型的线性阵列、环阵列、超立方体阵列和二维阵列上的脱机置换调度问题。最优排列调度涉及找到最小数量的非冲突源-目标路径子集。每个路径子集都可以建立为一次运行。证明了线性阵列上的最优置换调度是线性的，并且在环上np完全。在超立方体上，这个问题是np完全的。然而，作者讨论了一种O(N log N)算法，如果模型放宽到允许两个路由规则，即e-cube规则和e/sup -1/-cube规则，则该算法可以在两遍中路由任何排列。这种复杂性降低到O(N)个超立方体并行时间。考虑了一种基于O(N log/sup 2/ N)的双部分匹配算法，该算法用于调度p*q个网格/环面在q次传递中的任何排列。

引用次数: 13

Hyper Petersen network: yet another hypercube-like topology 超级彼得森网络:另一种超立方体拓扑结构

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234949

S.K. Das, A. Banerjee

The authors propose and analyze a new hypercubelike topology, called the hyper-Petersen (HP) network, which is constructed from the Cartesian product of a binary hypercube and the Petersen graph. The properties of HP topology include regularity, a high degree of symmetry and connectivity, and a small diameter. For example, it is shown that an n-dimensional HP network with N=1.25*2/sup n/ nodes covers 2.5 times more nodes than the binary hypercube at the cost of increasing the degree by one. Furthermore, with the same degree and connectivity, the diameter of the HP network is one less than that of a hypercube, yet it has a 1.25 times higher packing density. The authors also discuss the embedding of various other topologies such as meshes, trees, and twisted hypercubes on the HP, thereby emphasizing its rich interconnection structure with a simple routing scheme for message communication. A ring of odd length can be embedded in an HP network, which is a limitation of a binary hypercube.<>

作者提出并分析了一种新的超立方体拓扑，称为超Petersen (HP)网络，它是由二元超立方体与Petersen图的笛卡尔积构造而成的。HP拓扑具有规则性、高度对称性和连通性以及直径小等特点。例如，N=1.25*2/sup N /个节点的N维HP网络覆盖的节点数量是二元超立方体的2.5倍，其代价是度增加1。此外，在相同的度和连通性下，HP网络的直径比超立方体的直径小1，但其填充密度是超立方体的1.25倍。作者还讨论了在HP上嵌入各种其他拓扑结构，如网格、树和扭曲超立方体，从而强调了其丰富的互连结构和用于消息通信的简单路由方案。奇数长度的环可以嵌入到HP网络中，这是二元超立方体的限制。

引用次数: 58

Low-cost flexible simulation with the static perfect shuffle network 具有静态完美洗牌网络的低成本柔性仿真

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234883

K. Batcher

The static perfect shuffle (shuffle-exchange) network is based on H. S. Stone's (1971) perfect shuffle network; the two-by-two switches are removed and replaced by exchange links between processors. The network could be used in a low-cost flexible simulator for other networks such as multistage cube networks and hypercubes. The simulation of switched networks and of static networks is discussed.<>

静态完美洗牌(洗牌交换)网络是基于h.s. Stone(1971)的完美洗牌网络;二乘二的开关被移除，取而代之的是处理器之间的交换链路。该网络可用于其他网络(如多级立方体网络和超立方体网络)的低成本灵活模拟器。讨论了交换网络和静态网络的仿真。

引用次数: 0

Reconfigurable mesh algorithms for summing up binary values and its applications 二进制值求和的可重构网格算法及其应用

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234884

Y.-C. Chen, W.-T. Cheng

The authors present an asymptotically efficient parallel algorithm for summing up n binary values on reconfigurable meshes. They show that, given n binary values on an n/sup 1/2/*n/sup 1/2/ reconfigurable mesh with each processor containing one value initially, the summation of the n binary values can be performed in O(log log n) time. Several applications of the algorithm are presented. It is shown that summing up n b-bit numbers can be performed in O(b log log n) time on an n/sup 1/2/*n/sup 1/2/ reconfigurable mesh. Next, the histogram computation, of an n*n image can be completed in O(L* log log n) time on an n*n reconfigurable mesh, where L is the number of gray-level values. A parallel algorithm for computing the area and perimeter of image components in an n*n image is developed on an n*n reconfigurable mesh. The resulting time complexity is O(C log log n) time, where C is the number of image components. The implementation of enumeration sort on an n*n/sup 1/2/*n/sup 1/2/ reconfigurable mesh is shown. O(log log n) time is required for the sorting algorithm.<>

提出了一种可重构网格上n个二值求和的渐近高效并行算法。他们表明，在n/sup 1/2/*n/sup 1/2/可重构网格上给定n个二进制值，每个处理器初始包含一个值，n个二进制值的求和可以在O(log log n)时间内完成。给出了该算法的几种应用。结果表明，在n/sup 1/2/*n/sup 1/2/可重构网格上，可以在O(b log log n)时间内完成n个b位数字的求和。接下来，在n*n可重构网格上，n*n图像的直方图计算可以在O(L* log log n)时间内完成，其中L为灰度值的个数。提出了一种在n*n可重构网格上计算n*n图像中图像分量的面积和周长的并行算法。得到的时间复杂度为O(C log log n)时间，其中C为图像分量的个数。给出了在n*n/sup 1/2/*n/sup 1/2/可重构网格上枚举排序的实现。排序算法需要O(log log n)时间。>

引用次数: 2

An algorithm for a class of direct and inverse scattering problems 一类正散射和逆散射问题的算法

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234953

R. Poovendran, J. Dorband

A novel, highly parallel algorithm for a class of direct and inverse scattering problems is proposed. It is shown that this algorithm reduces the noise propagation exhibited by the existing algorithms, and produces error terms that are proportional to the square of the discrete step size. Unlike the conventional algorithms, this new formulation decouples the reflection kernel in a given layer. Due to its decoupling nature, the new formulation completely eliminated any error propagation between any two points in the same layer. Numerical examples are presented to illustrate the proposed algorithm.<>

针对一类正散射和逆散射问题，提出了一种新的高度并行算法。结果表明，该算法减少了现有算法所表现出的噪声传播，并产生与离散步长平方成正比的误差项。与传统算法不同，这个新公式解耦了给定层中的反射核。由于其解耦性，新公式完全消除了同一层中任意两点之间的任何误差传播。给出了数值算例来说明所提出的算法

引用次数: 1

Permutation routing in 2-stage recirculating delta networks 二级循环三角网络中的置换路由

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234872

B. Alleyne, I. Scherson

The authors discuss a N input/output recirculating network that can take advantage of compile-time knowledge of algorithm-dependent communications, but still performs efficiently on data-dependent permutations. A deterministic routing algorithm and a randomized routing algorithm are given. Mapping to Clos networks is considered.<>

作者讨论了一个N输入/输出循环网络，它可以利用依赖算法的通信的编译时知识，但仍然有效地执行依赖数据的排列。给出了一种确定性路由算法和一种随机路由算法。考虑到Clos网络的映射

引用次数: 2

Broadcasting in hypercubes with link/node failures 在链路/节点故障的超多维数据集中进行广播

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234947

S. Park, B. Bose

The authors propose simple and optimal fault-tolerant broadcasting algorithms in the presence of at most n-1 link or node faults in an n-dimensional hypercube. Further results for up to 2n-3 faulty links or nodes are also considered. These algorithms are optimal or close to optimal in terms of the number of communication steps. The algorithm takes n+1 time steps even in the presence of n-1 faulty links or nodes; this can be achieved even with a single port for up to 2n-3 link or node faults, even with a single port, the algorithms take at most n+3 steps.<>

针对n维超立方体中最多存在n-1个链路或节点故障的情况，提出了简单且最优的容错广播算法。还考虑了多达2n-3个故障链路或节点的进一步结果。这些算法在通信步骤数方面是最优或接近最优的。即使存在n-1个故障链路或节点，该算法也需要n+1个时间步长;即使使用单个端口，也可以实现多达2n-3个链路或节点故障，即使使用单个端口，算法最多也需要n+3个步骤。

引用次数: 9

Selection on the reconfigurable mesh 可重构网格上的选择

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1992-10-19 DOI: 10.1109/FMPC.1992.234907

E. Hao, P. MacKenzie, Q. Stout

A Theta (log n) time algorithm to select the kth smallest element in a set of n elements on a reconfigurable mesh with n processors is obtained. This improves on the previous fastest algorithm's running time by a factor of log n. It is also shown that variants of this problem can be solved even faster. Finally, a proof of Omega (log log n) lower bound time for the rmesh selection problem is given.<>

给出了一种Theta (log n)时间算法，用于在具有n个处理器的可重构网格上从n个元素集合中选择第k个最小元素。这比之前最快的算法的运行时间提高了log n倍。它还表明，这个问题的变体可以更快地解决。最后，给出了网格选择问题的下界时间为(log log n)的证明。

引用次数: 25

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

[Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀