Proceedings of the Fifth Distributed Memory Computing Conference, 1990.最新文献

英文中文

Implementation of JAC3D on The NCUBE/ten JAC3D在NCUBE/ 10上的实现

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555431

C. Vaughan

An implementation is presented for JAC3D on a massively parallel hypercube computer. JACSD, a three dimensional finite element code developed at Sandia, uses several hundred hours of Cray time each year in solving structural analysis problems. Two major areas of investigation are discussed: (1) the development of general methods, data structures, and routines to communicate information between processors, and (2) the implementation and evaluation of four algorithms to map problems onto the node processors of the hypercube in a loadbalanced fashion. The performance of JACJD on the NCUBE/ten is compared with that on a Cray X-MP: the NCUBE/ten version presently takes 20% more compute time than the Cray. On a larger simulation which used more of the NCUBE's memory, the NCUBE/ten would take less compute time than the Cray. Current activity on the newer NCUBE 2 hypercube is summarized which should lead to an order of magnitude improvement in run-time performance for the massively parallel solution of structural analysis problems.

提出了JAC3D在大规模并行超立方体计算机上的实现。在桑迪亚开发的三维有限元代码JACSD，每年使用数百小时的Cray时间来解决结构分析问题。本文讨论了两个主要研究领域:(1)开发通用方法、数据结构和例程，以便在处理器之间进行信息通信;(2)实现和评估四种算法，以负载均衡的方式将问题映射到超立方体的节点处理器上。JACJD在NCUBE/ 10上的性能与在Cray X-MP上的性能进行了比较:NCUBE/ 10版本目前比Cray多花费20%的计算时间。在使用更多NCUBE内存的大型模拟中，NCUBE/ 10将比Cray花费更少的计算时间。总结了更新的NCUBE 2超立方体上的当前活动，这将导致结构分析问题的大规模并行解决方案的运行时性能的数量级提高。

引用次数: 2

Visualization: An Aid to Design and Understand Neural Networks in a Parallel Environment 可视化:在并行环境中设计和理解神经网络的辅助工具

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556338

S.N. Gupta, M. Zubair, C. Grosch

In this paper, we describe two visualization tools developed on the DAP-510, a SIMD machine having 1024 processors. The two tools are (i) the interactive visualization tool, and (ii) the display tool. The interactive visualization tool allows the user to steer the course of computation by interactively modifying its parameters based on the visual feedback. The display tool transforms the numeric data into a visual form. It also gives the user capability to manipulate the visual representation. In the implementation of these tools we exploit the parallel features of DAP-510. These tools are utilized for designing and understanding the neural networks. However, it is worth mentioning that these tools are general in nature and can easily interact with other parallel computa, tion processes.

在本文中，我们描述了在具有1024个处理器的SIMD机器DAP-510上开发的两个可视化工具。这两个工具是(i)交互式可视化工具和(ii)显示工具。交互式可视化工具允许用户通过基于视觉反馈交互式修改其参数来引导计算过程。显示工具将数字数据转换为可视形式。它还为用户提供了操作可视化表示的能力。在这些工具的实现中，我们利用了DAP-510的并行特性。这些工具被用来设计和理解神经网络。然而，值得一提的是，这些工具本质上是通用的，并且可以很容易地与其他并行计算进程交互。

引用次数: 1

Parallel Sorting on Symult 2010 并行排序在Symult 2010

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555387

P.P. Li, Y. Tung

In this paper, three sorting algorithms, Bitonic sort, Shell sort and parallel Quicksort are studied. We analyze the performance of these algorithms and compare them with the empirical results obtained from the implementations on the Symult Series 2010, a distributed-memory, message-passing MIMD machine. Each sorting algorithm is a combination of a parallel sort component and a sequential sort component. These algorithms are designed for sorting M elements of random integers on a N-processor machine, where M > N . We found that Bitonic sort is the best parallel sorting algorithm for small problem size, ( M / N ) < 64, and the parallel Quicksort is the best for large problem size. The new Parallel Quicksort algorithm with a simple key selection method achieves a decent speed-up comparing with other versions of parallel Quicksort on similar parallel machines. Although Shell sort has a worse theoretical time complexity, it does achieve linear speedup for large problem size by using a synchronization step to detect early termination of the sorting steps.

本文研究了三种排序算法:Bitonic排序、Shell排序和并行快速排序。我们分析了这些算法的性能，并将它们与在Symult Series 2010(一个分布式内存、消息传递的MIMD机器)上实现的经验结果进行了比较。每个排序算法都是并行排序组件和顺序排序组件的组合。这些算法设计用于在N处理器机器上对随机整数的M个元素进行排序，其中M > N。研究发现，对于小问题规模(M / N) < 64时，Bitonic排序是最好的并行排序算法;对于大问题规模，并行快速排序是最好的并行排序算法。新的并行快速排序算法采用简单的键选择方法，与同类并行机器上的其他版本的并行快速排序相比，实现了较好的加速。尽管Shell排序具有较差的理论时间复杂度，但通过使用同步步骤来检测排序步骤的早期终止，它确实实现了大问题规模的线性加速。

引用次数: 6

The Finite Element Solution of Two-Dimensional Transverse Magnetic Scattering Problems on the Connection Machine 连接机上二维横向磁散射问题的有限元解

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555413

S. Hutchinson, S. Castillo, E. Hensel, K. Dalton

A study is conducted of the finite element solution of the partial differential equations governing twodimensional electromagnetic field scattering problems on a SIMD computer. A nodal assembly technique is introduced which maps a single node to a single processor. The physical domain is first discretized in parallel to yield the node locations of an 0-grid mesh. Next, the system of equations is assembled and then solved in parallel using a conjugate gradient algorithm for complexvalued, non-symmetric, non-positive definite systems. Using this technique and Thinking Machines Corporation’s Connection Machine-2 (CM-2) , problems with more than 250k nodes are solved. Results of electromagnetic scattering, governed by the 2-d scalar Helmholtz wave equations are presented for a variety of infinite cylinders and airfoil crosssections. Solutions are demonstrated for a wide range of objects. A summary of performance data is given for the set of test problems.

在SIMD计算机上研究了二维电磁场散射问题偏微分方程的有限元解。介绍了将单个节点映射到单个处理器的节点装配技术。首先对物理域进行并行离散，得到零网格的节点位置。接下来，对方程组进行组合，然后使用共轭梯度算法并行求解复值、非对称、非正定系统。利用这种技术和思维机器公司的连接机器-2 (CM-2)，解决了超过25万个节点的问题。给出了由二维标量亥姆霍兹波方程控制的各种无限圆柱体和翼型截面的电磁散射结果。解决方案展示了广泛的对象。给出了一组测试问题的性能数据摘要。

引用次数: 2

Parallel Methods for Solving Polynomial Problems on Distributed Memory Multicomputers 分布式存储多计算机上多项式问题的并行求解方法

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555390

Xiaodong Zhang, Hao Lu

We give a group of parallel methods for solving polynomial related problems and their implementations on a distributed memory multicomputer. These problems are 1. the evaluation of polynomials, 2. the multiplication of polynomials, 3. the division of polynomials, and 4. the interpolation of polynomials. Mathematical analyses are given for exploiting the parallelisms of these operations. The related parallel methods supporting the solutions of these polynomial problems, such as FFT, Toeplitz linear systems and others are also discussed. We present some experimental results of these parallel methods on the Intel hypercube. polynomials based on the Horner’s rule is discussed in section 2. The experimental results on the Intel hypercube are also presented. The parallelism of the polynomial multiplication is exploited by transferring the problem to a set of special FFT series functions, on which the operations can be perfectly distributed among different processors. Section 3 gives the mathematical analyses and parallel method of the polynomial multiplication. The polynomial division problem is solved based on parallel solutions for Toeplitz triangular linear systems and the parallel polynomial multiplication, and is discussed in section 4. Section 5 addresses a parallel method for the Lagrange piecewise cubic polynomial interpolation. Finally, we give a summary and future work in the last section.

给出了一组求解多项式相关问题的并行方法及其在分布式存储多计算机上的实现。这些问题是1。多项式的求值，2。多项式的乘法，3。多项式的除法，和4。多项式的插值。给出了利用这些运算的并行性的数学分析。讨论了支持这些多项式问题解的相关并行方法，如FFT、Toeplitz线性系统等。我们给出了这些并行方法在Intel超立方体上的一些实验结果。第2节将讨论基于Horner规则的多项式。并给出了在Intel超立方体上的实验结果。利用多项式乘法的并行性，将问题转化为一组特殊的FFT级数函数，在这些函数上的运算可以完美地分布在不同的处理器上。第三节给出了多项式乘法的数学分析和并行方法。多项式除法问题是基于Toeplitz三角形线性系统的并行解和并行多项式乘法来解决的，并在第4节中讨论。第5节讨论了拉格朗日分段三次多项式插值的并行方法。最后，对全文进行了总结和展望。

{"title":"Parallel Methods for Solving Polynomial Problems on Distributed Memory Multicomputers","authors":"Xiaodong Zhang, Hao Lu","doi":"10.1109/DMCC.1990.555390","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555390","url":null,"abstract":"We give a group of parallel methods for solving polynomial related problems and their implementations on a distributed memory multicomputer. These problems are 1. the evaluation of polynomials, 2. the multiplication of polynomials, 3. the division of polynomials, and 4. the interpolation of polynomials. Mathematical analyses are given for exploiting the parallelisms of these operations. The related parallel methods supporting the solutions of these polynomial problems, such as FFT, Toeplitz linear systems and others are also discussed. We present some experimental results of these parallel methods on the Intel hypercube. polynomials based on the Horner’s rule is discussed in section 2. The experimental results on the Intel hypercube are also presented. The parallelism of the polynomial multiplication is exploited by transferring the problem to a set of special FFT series functions, on which the operations can be perfectly distributed among different processors. Section 3 gives the mathematical analyses and parallel method of the polynomial multiplication. The polynomial division problem is solved based on parallel solutions for Toeplitz triangular linear systems and the parallel polynomial multiplication, and is discussed in section 4. Section 5 addresses a parallel method for the Lagrange piecewise cubic polynomial interpolation. Finally, we give a summary and future work in the last section.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130821873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Design and lmplementation of a Multi-Cache System on a Loosely Coupled Multiprocessor 多缓存系统在松耦合多处理器上的设计与实现

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556268

B. Rochat

The GOTHIC* distributed system implements a generalized virtual memory. The basic memory concept is the segment characterized by three main features : persistence, direct access and shared memory consistency. Thus, the segment can be used for interprocess communication and permanent storage management. This paper explores the design and implementation of a generalized virtual memory and emphasizes data sharing management in loosely coupled multiprocessor memory hierarchy.

GOTHIC*分布式系统实现了一种通用虚拟内存。基本的内存概念是具有三个主要特征的段:持久性、直接访问和共享内存一致性。因此，该段可用于进程间通信和永久存储管理。本文探讨了一种广义虚拟存储器的设计与实现，并着重介绍了在松散耦合的多处理器存储器层次结构中的数据共享管理。

引用次数: 2

Concurrent Implementation of a Fast Vortex Method 快速涡旋方法的并发实现

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555420

F. Pépin, A. Leonard

v2u = -V x (we,) . (3) Vortex methods are a powerfil tool for the numerical simulation of incompressible flows at high Reynolds number. They are based on a discrete representation of the vorticity field and in the inviscid limit, the computational elements, or vortices, are simply advected at the local fluid velocity. The numerical approximations transform the vorticity equation, a non-linear PDE, into a N-body problem. The S(N2) time complexity usually associated with these problems has limited the number of computational elements to a few thousands. This paper is concerned with the concurrent implementation of fast vortex methods that reduce the time complexity to U(N1ogN). The fast algorithm that is used combines a binary tree data structure with high order expansions for the induced velocity field. The implementation of this particular algorithm on an MIMD archilecture is discussed. Vortex Methods

v2u = - vx(我们，)(3)涡旋法是高雷诺数不可压缩流动数值模拟的有力工具。它们基于涡度场的离散表示，在无粘极限下，计算元素或涡以局部流体速度简单地平流。数值近似将涡度方程这一非线性偏微分方程转化为n体问题。通常与这些问题相关的S(N2)时间复杂度将计算元素的数量限制在几千个。本文研究了将时间复杂度降低到U(N1ogN)的快速涡旋方法的并行实现。所使用的快速算法将二叉树数据结构与诱导速度场的高阶展开相结合。讨论了该算法在MIMD体系结构上的实现。涡方法

引用次数: 0

Parallel Thinning on a Distributed Memory Machine 分布式内存机的并行细化

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555364

J. Baek, K. Teague

A p a r a l l e l th inning algor i thm based on boundary fo l lowing i s presented i n t h i s paper. The boundary o f each object region i s extracted and l inked i n pa ra l l e l . The resu l t ing object boundary data i s div ided based on the object s ize and the nurrber o f nodes f o r load balancing, then the divided objects are red is t r ibu ted t o the nodes. Each boundary i n a node i s projected on a Ilworking planell. Next, the boundary data i s repeatedly shrunken unti l only the skeleton o f the region remains. The conventional i t e r a t i v e pa ra l l e l algori thm as wel l as our new algor i thm are implemented on a hypercubetopology multiprocessor computer, the I n t e l iPSC/2. The two algorithms are compared and analyzed. Some resu l t ing f igures and execution times are presented.

在本文中，我们提出了一种基于边界的模糊模糊算法。提取每个目标区域的边界，并将其链接到每个目标区域。将得到的对象边界数据根据对象的大小和负载均衡的节点数进行划分，然后将被划分的对象划分到节点上。节点中的每个边界i都投影在一个工作平面上。接下来，边界数据被反复压缩，直到只剩下区域的骨架。传统算法和新算法都是在超立方体拓扑的多处理器计算机iPSC/2上实现的。对两种算法进行了比较和分析。给出了一些结果图和执行时间。

引用次数: 1

An Efficient Method For Distributing Data In Hypercube Computers 一种高效的超立方体计算机数据分布方法

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556292

D. Lee, M. Aboelaze

引用次数: 8

A Method to Parallelize Tridiagonal Solvers 一种并行化三对角解的方法

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555403

S. Muller

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀