[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation最新文献

英文中文

Solution to an architectural problem in parallel computing 并行计算中一个体系结构问题的解决方案

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89494

D. Lee

The author presents a solution to the previously unsolved problem of how to construct an array processor with N processing elements, N memory modules, and an interconnection network that allows parallel access and alignment of rows, columns, diagonals, contiguous blocks, and distributed blocks of N*N arrays. The solution leads to an array processor that is both simple and efficient in two critical respects: the memory system uses the minimum number of memory modules to achieve conflict-free memory access and is able to compute N addresses with O(log N) logic gates in O(1) time. The interconnection network is multistage with O(N log N) logic gates, and it can align any of these data vectors for store/fetch, as well as for subsequent processing with a single pass through the network.<>

作者提出了一种以前未解决的问题，即如何构建一个具有N*N阵列的N个处理单元、N个存储模块和一个允许并行访问和对齐N*N阵列的行、列、对角线、连续块和分布式块的互连网络的阵列处理器。该解决方案使阵列处理器在两个关键方面既简单又高效:存储器系统使用最少数量的存储器模块来实现无冲突的存储器访问，并且能够在O(1)时间内用O(log N)个逻辑门计算N个地址。互连网络是具有O(N log N)逻辑门的多级网络，它可以对齐任何这些数据向量进行存储/提取，以及通过网络的单次传递进行后续处理。

引用次数: 3

Porting an iterative parallel region growing algorithm from the MPP to the MasPar MP-1 将迭代并行区域增长算法从MPP移植到MasPar MP-1

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89456

J. Tilton

An iterative parallel region growing (IPRG) algorithm, developed and implemented on the massively parallel processor (MPP) at NASA Goddard, is described. The experience of porting the IPRG algorithm from the MPP to the MasPar MP-1 is related. Porting was very easy and straightforward, especially when the Dorband virtualization software was used. The porting discussed, consisting of 1879 lines of MPL code, was accomplished in just two weeks by the author. The major difference between the two implementations is that the looping over virtual parallel arrays had to be done explicitly and had to be the outermost loop (for efficiency) in the MPP Pascal implementation, whereas the same looping was done implicitly in the MPL implementation and could be done in the innermost loop. In a performance test on a 256*256 pixel section of a seven-band Landsat thematic mapper image data set, the smaller MasPar MP-1 computer had roughly the same or better performance as the MPP. In the initial iterations, when the regions were still very small, the MPP was about 25% faster than the MasPar MP-1. By iteration 14, the MasPar MP-1 was 33% faster than the MPP, and for ensuing iterations indications are that the MasPar MP-1 speedup versus the MPP will be even larger.<>

介绍了在NASA戈达德大型并行处理器(MPP)上开发和实现的迭代并行区域生长(IPRG)算法。介绍了将IPRG算法从MPP移植到MasPar MP-1的经验。移植非常简单和直接，特别是当使用Dorband虚拟化软件时。所讨论的移植由1879行MPL代码组成，作者仅在两周内就完成了。这两种实现之间的主要区别在于，在MPP Pascal实现中，虚拟并行数组的循环必须显式地完成，并且必须是最外层的循环(为了效率)，而在MPL实现中，相同的循环是隐式地完成的，并且可以在最内层的循环中完成。在对七波段Landsat主题绘图仪图像数据集的256*256像素部分进行的性能测试中，较小的MasPar MP-1计算机具有与MPP大致相同或更好的性能。在最初的迭代中，当区域仍然很小时，MPP比MasPar MP-1快25%左右。在第14次迭代中，MasPar MP-1比MPP快33%，并且在随后的迭代中，MasPar MP-1比MPP的加速速度将会更大。

{"title":"Porting an iterative parallel region growing algorithm from the MPP to the MasPar MP-1","authors":"J. Tilton","doi":"10.1109/FMPC.1990.89456","DOIUrl":"https://doi.org/10.1109/FMPC.1990.89456","url":null,"abstract":"An iterative parallel region growing (IPRG) algorithm, developed and implemented on the massively parallel processor (MPP) at NASA Goddard, is described. The experience of porting the IPRG algorithm from the MPP to the MasPar MP-1 is related. Porting was very easy and straightforward, especially when the Dorband virtualization software was used. The porting discussed, consisting of 1879 lines of MPL code, was accomplished in just two weeks by the author. The major difference between the two implementations is that the looping over virtual parallel arrays had to be done explicitly and had to be the outermost loop (for efficiency) in the MPP Pascal implementation, whereas the same looping was done implicitly in the MPL implementation and could be done in the innermost loop. In a performance test on a 256*256 pixel section of a seven-band Landsat thematic mapper image data set, the smaller MasPar MP-1 computer had roughly the same or better performance as the MPP. In the initial iterations, when the regions were still very small, the MPP was about 25% faster than the MasPar MP-1. By iteration 14, the MasPar MP-1 was 33% faster than the MPP, and for ensuing iterations indications are that the MasPar MP-1 speedup versus the MPP will be even larger.<<ETX>>","PeriodicalId":193332,"journal":{"name":"[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129255601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

A new computational model for massive parallelism 一种新的大规模并行计算模型

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89466

S. Berkovich

A universal parallel computational model using the concept of distributed associative processing (DASP) is presented. The effective realization of this model is due to unique facilities of a multiaccess content-induced transaction overlap communication technique. The operative mechanism of the model uses a plurality of processing nodes with content-addressable transmission buffers. This construction can be incrementally expanded through the hierarchical multiplexing of information pathways. The developed principles simplify hardware/software organization and ensure performance gains for a variety of computational schemes.<>

提出了一种基于分布式关联处理(DASP)的通用并行计算模型。该模型的有效实现得益于多址内容诱导的事务重叠通信技术的独特功能。该模型的操作机制使用具有内容可寻址传输缓冲器的多个处理节点。这种结构可以通过信息路径的分层多路复用逐步扩展。所开发的原理简化了硬件/软件组织，并确保了各种计算方案的性能提高。

引用次数: 1

Multiple channel architecture 多通道架构

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89477

T. S. Wailes, D. Meyer

A parallel processing architecture based on multiple-channel optical communication is described. A large number of independent, selectable channels (or virtual buses) are available using a single optical fiber. Arbitrary interconnection patterns, as well as machine partitions, can be emulated by using appropriate channel assignments. Hierarchies of parallel architectures and simultaneous execution of parallel tasks are also possible. Recent technological advances in semiconductor laser technology that make such a parallel architecture possible, a basic system overview, various channel allocation strategies, and a summary of advantages compared with traditional interconnection techniques are presented.<>

介绍了一种基于多通道光通信的并行处理体系结构。使用一根光纤可以提供大量独立的、可选择的通道(或虚拟总线)。可以通过使用适当的通道分配来模拟任意的互连模式以及机器分区。并行架构的层次结构和并行任务的同时执行也是可能的。半导体激光技术的最新技术进步使这种并行架构成为可能，介绍了基本系统概述，各种信道分配策略，并总结了与传统互连技术相比的优势。

引用次数: 3

An interconnection network and a routing scheme for a massively parallel message-passing multicomputer 一种用于大规模并行消息传递的多计算机的互连网络和路由方案

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89484

C. Germain, Jean-Luc Béchennec, D. Etiemble, J. Sansonnet

The communication system of a massively parallel architecture called MEGA is presented. The implications of massive parallelism for routing strategies and communication models are discussed. A routing strategy, called forced routing, is proposed. It minimizes contention by making full use of all the shortest paths in the network. Its performance has been studied by simulation, and the results are presented. The mixed communication model used allows processes with mutual reference to have direct information exchanges. The routing strategy can be implemented within a restricted chip area in CMOS technology.<>

提出了一种大规模并行架构的通信系统MEGA。讨论了大规模并行对路由策略和通信模型的影响。提出了一种称为强制路由的路由策略。它通过充分利用网络中的所有最短路径来最小化争用。对其性能进行了仿真研究，并给出了仿真结果。所使用的混合通信模型允许具有相互引用的流程进行直接的信息交换。在CMOS技术中，该路由策略可以在有限的芯片区域内实现。

引用次数: 9

Performance prediction-How good is good? 业绩预测——多好才算好?

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89435

B. Stramm, Francine Berman

The prediction of performance of parallel algorithms mapped to parallel computers is addressed. Performance prediction models are parameterized by the algorithms, mapping, and target machine. A mechanism for comparing the accuracy of the models that establishes a partial order between them is proposed.<>

研究了映射到并行计算机上的并行算法的性能预测问题。性能预测模型由算法、映射和目标机参数化。提出了一种比较模型精度的机制，即在它们之间建立偏序。

引用次数: 5

High performance mapping for massively parallel hierarchical structures 大规模并行分层结构的高性能映射

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89467

Sotirios G. Ziavras

Techniques for mapping image processing and computer vision algorithms onto a class of hierarchically structured systems are presented. In order to produce mappings of maximum efficiency, objective functions that measure the quality of given mappings with respect to particular optimization goals are proposed. The effectiveness and the computation complexity of mapping algorithms that yield very high performance by minimizing the objective functions are discussed. Performance results are also presented.<>

介绍了将图像处理和计算机视觉算法映射到一类层次结构系统的技术。为了产生效率最高的映射，提出了相对于特定优化目标衡量给定映射质量的目标函数。讨论了通过最小化目标函数而产生高性能的映射算法的有效性和计算复杂度。并给出了性能结果

引用次数: 0

The StarLite project StarLite项目

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89501

E. llEEllEEliI, James G. Smith

The components and design of the StarLite programming environment for Modula-2 and several projects that are being developed using StarLite are discussed. StarLite emulates a 1000-node, shared virtual memory multiprocessor on a single hardware processor. The parallel programming package is discussed, and some of the operating system design problems for a massively parallel computer are reviewed.<>

讨论了Modula-2的StarLite编程环境的组成和设计，以及几个正在使用StarLite开发的项目。StarLite在单个硬件处理器上模拟1000个节点的共享虚拟内存多处理器。讨论了并行编程包，回顾了大规模并行计算机操作系统设计中的一些问题

引用次数: 5

An asynchronous multiprocessor design for branch-and-bound algorithms 分支绑定算法的异步多处理器设计

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89440

Kam-Hoi Cheng, Q. Wang

A fast asynchronous multiprocessor system designed to implement branch-and-bound algorithms is described. Cooperating processors are only responsible for performing computation essential to the problem. Dynamic sharing of work and coordinate among processors are provided by several servers, all of which are capable of handling multiple accesses simultaneously. It is shown how to coordinate the use of these designs and prove the correctness of the authors' solution in reactivating idle processors and detecting the termination of the computation. The loss of computation power due to the uneven work-load distribution, coordination, and synchronization of processors has been reduced significantly compared with other hardware designs.<>

介绍了一种实现分支绑定算法的快速异步多处理器系统。协作处理器只负责执行对问题至关重要的计算。处理器之间的动态工作共享和协调由多个服务器提供，所有这些服务器都能够同时处理多个访问。说明了如何协调这些设计的使用，并证明了作者的解决方案在重新激活空闲处理器和检测计算终止方面的正确性。与其他硬件设计相比，由于处理器的工作负载分布、协调和同步不均匀而导致的计算能力损失已经大大减少

引用次数: 6

Comparative performance evaluation of a new SIMD machine 一种新型SIMD机器的性能比较评价

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

Pub Date : 1990-10-08 DOI: 10.1109/FMPC.1990.89468

J. M. Jennings, Edward W. Davis, R. A. Heaton

The performance of BLITZEN, a new massively parallel machine, is compared with that of the Massively Parallel Processor (MPP) for two image-processing functions: rotation and resampling. These functions, as implemented on the MPP, were modified to exploit new architectural features of BLITZEN. The functional simulator of BLITZEN, used for algorithm development and timing information, is described. A performance comparison based on instruction cycle counts shows a significant speedup for the new machine due to architectural features that improve data movement capability.<>

在旋转和重采样两项图像处理功能上，对比了新型大规模并行处理器BLITZEN与大规模并行处理器MPP的性能。这些在MPP上实现的功能被修改，以利用BLITZEN的新架构特征。介绍了用于算法开发和时序信息获取的BLITZEN功能模拟器。基于指令周期计数的性能比较显示，由于改进了数据移动能力的架构特性，新机器有了显著的加速

引用次数: 7

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

[1990 Proceedings] The Third Symposium on the Frontiers of Massively Parallel Computation

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀