Proceedings Sixth International Parallel Processing Symposium最新文献

英文中文

An optimal sorting algorithm on reconfigurable mesh 可重构网格的最优排序算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223059

Ju-wook Jang, V. Prasanna

An optimal sorting algorithm on the reconfigurable mesh is proposed. The algorithm sorts n numbers in constant time using n*n processors. The best known previous result uses O(n*nlog/sup 2/n) processors. The presented algorithm satisfies the AT/sup 2/ lower bound of Omega (n/sup 2/) for sorting n numbers in the word model of VLSI. Modification to the algorithm for area-time trade-off is shown, to achieve AT/sup 2/ optimality over 1>

提出了一种可重构网格的最优排序算法。该算法使用n*n个处理器在常数时间内对n个数字进行排序。之前最著名的结果使用了O(n*nlog/sup 2/n)个处理器。该算法满足超大规模集成电路字模型中n个数排序的AT/sup 2/ ω (n/sup 2/)下界。显示了对区域-时间权衡算法的修改，以实现AT/sup 2/优于1>的最优性。

引用次数: 139

IDPS: a massively parallel heuristic search algorithm IDPS:一个大规模并行启发式搜索算法

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223042

A. Mahanti, C. J. Daniels

Presents an efficient SIMD parallel algorithm, called IDPS (iterative deepening parallel search). The performance of four variants of IDPS is studied through experiments conducted on the well known test-bed problem for search algorithms, the 15-puzzle. During the experiments, data were gathered under two different static load-balancing schemes. Under the first scheme, an average efficiency of approximately /sup 3///sub 4/ was obtained for 4 K, 8 K, and 16 K processors. Under the second scheme, average efficiencies of 0.92 and 0.76 were obtained for 8 K and 16 K processors, respectively. It is also shown that for admissible search, linear or superlinear average speedup can be obtained for problems of significant size.<>

提出了一种高效的SIMD并行算法IDPS(迭代深化并行搜索)。通过在著名的搜索算法测试台问题15-puzzle上进行的实验，研究了四种IDPS变体的性能。在实验中，采用两种不同的静态负载均衡方案收集数据。在第一种方案下，4 K、8 K和16 K处理器的平均效率约为/sup 3/// //sub 4/。在第二种方案下，8k和16k处理器的平均效率分别为0.92和0.76。对于可容许搜索，对于显著大小的问题，可以得到线性或超线性的平均加速

引用次数: 3

A paradigm for distributed deadlock avoidance in multicomputer networks 多计算机网络中分布式死锁避免的范例

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222993

J. P. Samantarai

A paradigm for avoiding buffer deadlock in point-to-point multicomputer networks is presented which is ideal for today's high connectivity, load sharing networks. Unlike the traditional resource ordering principle, this paradigm not only allows unrestricted routing but uses the existence of multiple paths to its direct advantage. Deadlock is avoided entirely using exchange buffers which are not used for message queues, thus eliminating queueing overhead. The paradigm is topology-independent, imposes no routing restrictions, and uses states of neighboring links only, so that it can be built into link level protocol, providing unrestricted deadlock-free routing, while operating transparent to any fault-tolerant topology-specific routing algorithm.<>

提出了一种在点对点多计算机网络中避免缓冲区死锁的范例，该范例非常适合当今的高连接、负载共享网络。与传统的资源排序原则不同，这种范式不仅允许不受限制的路由，而且利用多条路径的存在来发挥其直接优势。使用不用于消息队列的交换缓冲区完全避免了死锁，从而消除了队列开销。该范式是拓扑独立的，不施加路由限制，并且只使用相邻链路的状态，因此它可以被构建到链路级协议中，提供不受限制的无死锁路由，同时对任何容错拓扑特定的路由算法透明。

引用次数: 0

The interplay between granularity, performance and availability in a replicated Linda tuple space 复制Linda元组空间中粒度、性能和可用性之间的相互作用

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222976

S. Kambhatla, J. Walpole

Replication is a common method for increasing the availability of data in a distributed environment. The authors' interest is in the application of replication techniques in the domain of parallel processing. They explore the issues concerning degree of replication and granularity in the context of a distributed and highly available Linda tuple space. In particular, they study the performance effects of varying the number of replicas and the granularities of replication and concurrency control. Traditionally, when using replication in databases, the granularity of replication and that of concurrency control have been the same (at the file level (D.K. Gifford, 1979), for example). This is not an inherent requirement however. The authors show by detailed simulation of a replicated Linda tuple space that it is useful to separate the two granularities and that it is an important design issue especially in parallel processing systems.<>

复制是在分布式环境中提高数据可用性的常用方法。作者的兴趣是在并行处理领域复制技术的应用。他们探讨了在分布式和高可用的Linda元组空间上下文中有关复制程度和粒度的问题。特别是，他们研究了改变副本数量、复制粒度和并发控制的性能影响。传统上，当在数据库中使用复制时，复制的粒度和并发控制的粒度是相同的(例如，在文件级别(D.K. Gifford, 1979))。然而，这并不是一个固有的要求。作者通过对一个复制的Linda元组空间的详细模拟表明，分离两个粒度是有用的，这是一个重要的设计问题，特别是在并行处理系统中

引用次数: 1

Multi-packet selection on mesh-connected processor arrays 网格连接处理器阵列的多包选择

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.222999

D. Krizanc, L. Narayanan

The authors show efficient, deterministic algorithms for selection on the mesh-connected processor array, in the case when there are several elements at every processor. In particular, on a p-processor mesh, with N>or=p elements, stored N/p at every processor, they show that selection can be performed in O(min(plog/sup N///sub p/, max(N/p/sup 2/3/, square root p))) communication steps. The best previously known results were based on sorting and required O(N/ square root p) communication steps, for N>or=p.<>

在每个处理器上都有多个元素的情况下，作者展示了在网格连接的处理器阵列上进行选择的高效、确定性算法。特别是，在p-处理器网格上，具有N>或=p个元素，在每个处理器上存储N/p，它们表明选择可以在O(min(plog/sup N///sub p/， max(N/p/sup 2/3/，平方根p))通信步骤中执行。先前已知的最佳结果是基于排序和需要O(N/平方根p)个通信步骤，对于N>或=p。

引用次数: 9

Exploiting concurrency among tasks in partitionable parallel processing systems 利用可分区并行处理系统中任务之间的并发性

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223076

W. Nation, A. A. Maciejewski, H. Siegel

One benefit of partitionable parallel processing systems is their ability to execute multiple, independent tasks simultaneously. Previous work has identified conditions such that, when there are k tasks to be processed, partitioning the system such that all k tasks are processed simultaneously results in a minimum overall execution time. An alternate condition is developed that provides additional insight into the effects of parallelism on execution time. This result, and previous results, however, assume that execution times are data independent. It is shown that data-dependent tasks do not necessarily execute faster when processed simultaneously even if the condition is met. A model is developed that provides for the possible variability of a task's execution time and is used in a new framework to study the problem of finding an optimal mapping for identical, independent data-dependent execution time tasks onto partitionable systems. Extension of this framework to situations where the k tasks are non-identical is discussed.<>

可分区并行处理系统的一个优点是能够同时执行多个独立任务。以前的工作已经确定了这样的条件:当有k个任务需要处理时，对系统进行分区，以便同时处理所有k个任务，从而使总体执行时间最短。开发了另一种条件，可以进一步了解并行性对执行时间的影响。然而，这个结果和前面的结果都假定执行时间与数据无关。结果表明，即使满足条件，数据相关任务在同时处理时也不一定执行得更快。开发了一个模型，该模型提供了任务执行时间的可能可变性，并在一个新的框架中用于研究为相同的、独立的、依赖数据的执行时间任务寻找到可分区系统的最佳映射问题。讨论了将该框架扩展到k个任务不相同的情况。

引用次数: 1

Vector Hartley transform employing multiprocessors 矢量哈特利变换采用多处理器

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223038

R. Mahapatra, Akhilesh Kumar

Many parallel implementations for signal processing transforms have already been reported. The implementation of Hou's FHT algorithm (1987) has been studied on three multiprocessor architectures (MPAs): multiprocessors connected through a shared bus; multiprocessors connected by an indirect binary n-cube multistage interconnection network and mesh connected multiprocessors. The article analyzes the performance of a vector Hartley transform algorithm on these MPAs.<>

许多信号处理变换的并行实现已经被报道。Hou的FHT算法(1987)的实现已经在三种多处理器架构(MPAs)上进行了研究:通过共享总线连接的多处理器;由间接二进制n-立方体多级互连网络连接的多处理器和网状连接的多处理器。本文分析了矢量哈特利变换算法在这些MPAs上的性能

引用次数: 1

Conflict-free scheduling of nested loop algorithms on lower dimensional processor arrays 低维处理器阵列上嵌套循环算法的无冲突调度

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223054

Zhenhui Yang, Weijia Shang, J. Fortes

In practice, it is interesting to map n-dimensional algorithms, or algorithms with n nested loops, onto (k-1)-dimensional arrays where k>

在实践中，将n维算法或具有n个嵌套循环的算法映射到k>的(k-1)维数组上是很有趣的。

引用次数: 9

Memory requirements to balance thus asymptotically full-speedup FFT computation on processor arrays 内存需求平衡，因此渐近加速FFT计算在处理器阵列上

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223045

J. Shieh

The paper proves that for a linearly-connected array of alpha processors or a mesh-connected array of alpha /sup 2/ processors, where each processor has computation bandwidth C, I/O bandwidth I and C/I=logm, Omega (m/sup alpha /) memory size is required in each processor to minimize the I/O requirement in balancing the FFT computation. Then it presents balanced FFT algorithms on these arrays to meet their memory size lower bounds. These algorithms are time optimal exhibiting full speedups.<>

本文证明了对于alpha处理器线性连接阵列或alpha /sup 2/处理器网格连接阵列，其中每个处理器的计算带宽为C, I/O带宽为I，且C/I=logm，在平衡FFT计算时，每个处理器需要Omega (m/sup alpha /)内存大小以最小化I/O需求。然后在这些数组上提出平衡的FFT算法以满足它们的内存大小下界。这些算法是时间最优的，表现出完全的加速。

引用次数: 1

Bussed interconnection networks from trees 从树的总线互连网络

Proceedings Sixth International Parallel Processing Symposium

Pub Date : 1992-03-01 DOI: 10.1109/IPPS.1992.223015

C. M. Fiduccia

Pin limitations are a fundamental obstacle in the construction of massively parallel computers. The paper introduces a class of d-dimensional bussed hypercubes that can perform simultaneous bidirectional communication across any dimension using d+1, rather than 2d, ports per node. Each network Q/sub d/(T) is based on a tree T, which specifies the 'shape' of the busses, and can perform d(d+1)/2 permutations pi /sub ij/(x)=x(+)c/sub ij/ via a simple global command. This construction is then generalized to any d permutations II=( pi /sub 1/,. . ., pi /sub d/) of any set of nodes X. Given any edge-labeled directed tree T, whose kth arc is associated with the permutation pi /sub k/, a bussed network N(II,T) is constructed that can-in one clock tick-perform any of the O(d/sup 2/) permutations arising from the paths in the tree T.<>

引脚限制是构建大规模并行计算机的一个根本障碍。本文介绍了一类d维总线超立方体，它可以通过每个节点使用d+1而不是2d端口在任何维度上执行同时双向通信。每个网络Q/sub d/(T)都基于一个树T，它指定了总线的“形状”，并且可以通过一个简单的全局命令执行d(d+1)/2个排列pi /sub ij/(x)=x(+)c/sub ij/。然后将这种构造推广到任何节点集合x的任何d个排列II=(pi /sub 1/，…，pi /sub d/)。给定任何边标记有向树T，其第k个弧与排列pi /sub k/相关联，构造一个总线网络N(II,T)，它可以在一个时钟内执行从树T的路径中产生的O(d/sup 2/)个排列中的任何一个。

引用次数: 0

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Sixth International Parallel Processing Symposium

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀