Proceedings Sixth International Parallel Processing Symposium最新文献

英文中文

Comparisons and analysis of massively parallel SIMD architectures for parallel logic simulation 用于并行逻辑仿真的大规模并行SIMD体系结构的比较与分析

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222986

Eunmi Choi, M. Chung, Yunmo Chung

This paper compares and analyzes massively parallel SIMD architectures as processing environments for parallel logic simulation. The CM-2 and the MP-1 are considered as target machines for the comparison. Detailed contrasts between the two parallel schemes are made based on actual simulation results and system performance. Distributed event-driven simulation protocols are used to obtain experimental results for the two massively SIMD machines. According to the results, the MP-1 is 2 to 2.5 times faster than the CM-2 for up to 16 K gate benchmark circuits, while the CM-2 can accommodate circuits with a larger number of gates of processors. The presented comparisons and analysis of the two machines can be used to choose a SIMD machine for efficient parallel logic simulation.

本文比较分析了大规模并行SIMD体系结构作为并行逻辑仿真的处理环境。CM-2和MP-1被认为是比较的目标机。根据实际仿真结果和系统性能对两种并行方案进行了详细对比。采用分布式事件驱动仿真协议对两台大型SIMD机器进行了实验验证。根据结果，MP-1的速度是CM-2的2到2.5倍，适用于16k门基准电路，而CM-2可以适应具有更多处理器门数的电路。通过对两种机器的比较和分析，可以为选择一种SIMD机器进行高效的并行逻辑仿真提供参考。

引用次数: 0

Efficient parallel algorithms for selection and searching on sorted matrices 排序矩阵选择与搜索的高效并行算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223063

R. Sarnath, Xin He

Parallel algorithms for more general versions of the well known selection and searching problems are formulated. The authors look at these problems when the set of elements can be represented as an n*n matrix with sorted rows and columns. The selection algorithm takes O(lognloglogn log* n) time with O(n/log nlog* n) processors on an EREW PRAM. The searching algorithm takes O(loglogn) time with O(n/loglogn) processors on a CREW PRAM, which is optimal. The authors also show that no algorithm using at most n log/sup c/ n processors, c>or=1, can solve the matrix search problem in time faster than Omega (log log n).<>

为更一般版本的众所周知的选择和搜索问题制定了并行算法。当元素集可以表示为具有排序行和列的n*n矩阵时，作者会考虑这些问题。在EREW PRAM上使用O(n/log nlog* n)个处理器，选择算法需要O(logloglog * n)时间。在CREW PRAM上使用O(n/loglog)个处理器，搜索算法需要O(loglog)时间，这是最优的。作者还证明，在c>或=1的情况下，使用最多n log/sup c/ n个处理器的算法都不能比Omega (log log n)更快地解决矩阵搜索问题。

引用次数: 16

A functional execution model for a non-dataflow tagged token architecture 非数据流标记令牌架构的功能执行模型

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222978

G. Jennings

The author proposes a new execution model for a non-dataflow tagged-token architecture which is not Petri-net based but rather more closely related to the lambda calculus. The model exploits a functional programming style having applicative-order evaluation. The computation's execution graph is dynamically generated according to easily understood dynamic tagging rules which have been demonstrated to be implementable. The model permits conceptually unbounded parallelism for an interesting class of list-oriented computations. The author explains the model with the help of a simple dot-product computation as an example. He highlights some of the major differences between the dataflow paradigm and his own. Architectural issues toward implementation are briefly discussed.<>

作者提出了一种新的非数据流标记令牌架构的执行模型，该模型不是基于Petri-net的，而是与lambda演算更密切相关。该模型利用具有应用级求值的函数式编程风格。计算的执行图是根据易于理解的动态标记规则动态生成的，这些规则已被证明是可实现的。对于一类有趣的面向列表的计算，该模型允许概念上的无界并行。作者以一个简单的点积计算为例对该模型进行了说明。他强调了数据流范式和他自己的范式之间的一些主要区别。简要讨论了实现的体系结构问题。

引用次数: 2

An optimal parallel algorithm for arithmetic expression parsing 算法表达式解析的一种最优并行算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223044

W. Deng, S. Iyengar

The paper discusses an optimal parallel algorithm for tree form generation of arithmetic expressions on an SIMD-SM EREW model. The main idea is how to avoid the read conflict posted by Bar-On and Vishkin's algorithm (1985) by modifying their parenthesis pairing algorithm.<>

本文讨论了SIMD-SM EREW模型上树形算术表达式生成的最优并行算法。主要思想是如何通过修改括号配对算法来避免Bar-On和Vishkin算法(1985)提出的读冲突

引用次数: 2

The bus-usage method for the analysis of reconfiguring networks algorithms 分析重构网络算法的总线使用方法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223056

Y. Ben-Asher, A. Schuster

Reconfigurable networks have attracted increased attention recently, as an extremely strong parallel model which is realizable in hardware. The authors consider the basic problem of gathering information which is dispersed among the nodes of the network. They analyze the complexity of the problem on reconfigurable linear-arrays. The analysis introduces a novel criteria for the efficiency of reconfigurable network algorithms, namely the bus-usage. The bus-usage quantity measures the utilization of the network sub-buses by the algorithm. It is shown how this yields bounds on the algorithm run-time, by deriving a run-time to bus-usage trade-off.<>

可重构网络作为一种可在硬件上实现的极强的并行模型，近年来受到越来越多的关注。作者考虑了分散在网络节点上的信息收集的基本问题。他们分析了可重构线性阵列问题的复杂性。该分析引入了一种新的可重构网络算法效率标准，即总线使用率。总线使用率通过算法来衡量网络子总线的利用率。它显示了如何通过导出运行时与总线使用的权衡来产生算法运行时的边界。

引用次数: 1

Supporting matrix operations in vector architectures 支持向量架构中的矩阵操作

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223043

H. Bi, W. Giloi

Many elementary numerical algorithms involve not only vector operations but also matrix operations. Today's vector processors only support vector operations, and execute matrix operations in terms of vector operations, because they can not access matrix operands in one instruction. This will lead to poor sustained performances of vector machines. The paper discusses how to support both vector operations and matrix operations in vector architectures. At first subarray patterns for vector and matrix operations are introduced. Then it presents a set of accessing modes which can make vector architectures to access both vector and matrix operands. Finally the performance improvement for matrix multiplication and the FFT is demonstrated.<>

许多初等数值算法不仅涉及向量运算，还涉及矩阵运算。目前的向量处理器只支持向量运算，并根据向量运算来执行矩阵运算，因为它们不能在一条指令中访问矩阵操作数。这将导致向量机的持续性能较差。本文讨论了如何在矢量体系结构中同时支持矢量运算和矩阵运算。首先介绍了矢量和矩阵运算的子阵列模式。然后给出了一组访问模式，使向量结构可以同时访问向量和矩阵操作数。最后演示了矩阵乘法和FFT的性能改进

引用次数: 0

Asymmetrical multiconnection three-stage Clos networks 不对称多连接三级Clos网络

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1002/net.3230230423

A. Varma, S. Chalasani

The authors study routing problems in a general class of asymmetrical three-stage Clos networks. This class covers many asymmetrical three-stage networks considered by earlier researchers. They derive necessary and sufficient conditions under which this class of networks is rearrangeable with respect to a set of multiconnections, that is, connections where the paired entities are not limited to single terminals but can be arbitrary subsets of the terminals. They model the routing problem in these networks as a network-flow problem. If the number of switching elements in the first and last stages of the network is O(f) and the number of switching elements in the middle stage is m, then the network-flow model yields a routing algorithm with running time O(mf/sup 3/).<>

研究一类非对称三阶段Clos网络的路由问题。本课程涵盖了许多早期研究人员所考虑的不对称三阶段网络。他们推导了该类网络相对于一组多连接是可重排的充分必要条件，即在多连接中，配对实体不限于单个终端，而可以是终端的任意子集。他们将这些网络中的路由问题建模为网络流问题。如果网络第一阶段和最后阶段的交换元素数量为O(f)，中间阶段的交换元素数量为m，则网络流模型得到运行时间为O(mf/sup 3/)的路由算法。

引用次数: 14

Serial and parallel algorithms for the medial axis transform 中轴线变换的串行和并行算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223025

J. Jenq, S. Sahni

The authors develop an O(n/sup 2/) time serial algorithm to obtain the medial axis transform (MAT) of an n*n image. An O(logn) time CREW PRAM algorithm and an O(log/sup 2/n) time SIMD hypercube parallel algorithm for the MAT are also developed. Both of these use O(n/sup 2/) processors. Two problems associated with the MAT are also studied. These are the area and perimeter reporting problem. The authors develop an O(logn) time hypercube algorithm for both of these problems. Here n is the number of squares in the MAT and the algorithms use O(n/sup 2/) processors.<>

本文提出了一种O(n/sup 2/)时间序列算法来获取n*n图像的中轴变换(MAT)。同时提出了一种O(logn)时间的CREW PRAM算法和O(log/sup 2/n)时间的SIMD超立方并行算法。它们都使用O(n/sup 2/)处理器。与MAT相关的两个问题也进行了研究。这是区域和周边报告问题。针对这两个问题，作者开发了一个O(logn)时间的超立方体算法。这里n是MAT中的方格数，算法使用O(n/sup 2/)个处理器。

引用次数: 40

A hierarchical directory scheme for large-scale cache-coherent multiprocessors 面向大规模缓存相干多处理器的分层目录方案

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223074

Y. Maa, D. Pradhan, D. Thiébaut

Cache coherence problem is a major design issue for shared-memory multiprocessors. As the system size scales, traditional bus-based snoopy cache coherence schemes are no longer adequate. Instead, the directory-based scheme is a promising approach to deal with the large-scale cache coherence problem. However, the storage overhead of directory schemes often becomes too prohibitive as the system size increases. The paper proposes the hierarchical full-map directory to reduce the storage requirement while still achieving satisfactory performance. The key point is to exploit the inherent geographical interprocessor locality among shared data in the parallel programs. Trace-driven evaluations show that the performance of the proposed scheme compares competitively to the full-map directory scheme, while reducing the storage overhead by over 90%. The proposed hierarchical full-map directory scheme seems to be a promising hardware approach for handling cache coherence in the design of future large-scale multiprocessor memory systems.<>

缓存一致性问题是共享内存多处理器设计中的一个主要问题。随着系统规模的扩大，传统的基于总线的snoopy缓存一致性方案已不再适用。相反，基于目录的方案是处理大规模缓存一致性问题的一种很有前途的方法。但是，随着系统大小的增加，目录方案的存储开销常常变得过高。本文提出了分层全映射目录，以减少存储需求，同时仍能获得满意的性能。关键是利用并行程序中共享数据间固有的地理局部性。跟踪驱动的评估表明，该方案的性能与全映射目录方案相比具有竞争力，同时将存储开销减少了90%以上。所提出的分层全映射目录方案似乎是未来大规模多处理器存储系统设计中处理缓存一致性的一种有前途的硬件方法。

引用次数: 6

Determining maximum k-width-connectivity on meshes 确定网格上的最大k-宽度连通性

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223040

Susanne E. Hambrusch, F. Dehne

Let I be a n*n binary image stored in a n*n mesh of processors with one pixel per processor. Image I is k-width-connected if, informally, between any pair of pixels of value 'I' there exists a path of width k (composed of 1-pixels only). The authors consider the problem of determining the largest integer k such that I is k-width-connected, and present an optimal O(n) time algorithm for the mesh architecture.<>

假设我是一个n*n个二进制图像存储在n*n个处理器的网格中，每个处理器一个像素。如果在任意一对值为“I”的像素之间存在一条宽度为k的路径(仅由1个像素组成)，则图像I是k-宽度连接的。作者考虑了使I是k-宽度连接的最大整数k的确定问题，并提出了一种最优的O(n)时间网格结构算法。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Sixth International Parallel Processing Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀