Proceedings of the Fifth Distributed Memory Computing Conference, 1990.最新文献

英文中文

Concurrent DASSL Applied to Dynamic Distillation Column Simulation 并发DASSL在动态精馏塔模拟中的应用

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555439

A. Skjellum, M. Morari

The accurate, high-speed solution of systems of ordinary differential-algebraic equations (DAE’s) of low index is of great importance in chemical, electrical and other engineering disciplines. Petzold’s Fortran-based DASSL is the most widely used sequential code for solving DAE’s. We have devised and implemented a completely new C code, Concurrent DASSL, specifically for multicomputers and patterned on DASSL. In this work, we address the issues of data distribution and the performance of the overall algorithm, rather than just that of individual steps. Concurrent DASSL is designed as an open, application-independent environment below which linear algebra algorithms may be added in addition to standard support for dense and sparse algorithms. The user may furthermore attach explicit data interconversions between the main computational steps, or choose compromise distributions. A “problem formulator” (simulation layer) must be constructed above Concurrent DASSL, for any specific problem domain. We indicate performance for a particular chemical engineering application, a sequence of coupled distillation columns. Future efforts are cited in conclusion.

低指数常微分-代数方程组的精确、高速解在化学、电气和其他工程学科中具有重要意义。Petzold的基于fortran的DASSL是用于求解DAE的最广泛使用的顺序代码。我们设计并实现了一种全新的C代码，Concurrent DASSL，专门用于多计算机，并以DASSL为模式。在这项工作中，我们解决了数据分布和整体算法性能的问题，而不仅仅是单个步骤的问题。并发DASSL被设计成一个开放的、独立于应用程序的环境，在这个环境下，除了对密集和稀疏算法的标准支持外，还可以添加线性代数算法。用户还可以在主要计算步骤之间附加显式数据转换，或者选择折衷分布。对于任何特定的问题领域，必须在Concurrent DASSL之上构造一个“问题制定器”(模拟层)。我们指出了一个特定的化学工程应用的性能，一个序列的耦合精馏塔。最后列举了今后的努力。

{"title":"Concurrent DASSL Applied to Dynamic Distillation Column Simulation","authors":"A. Skjellum, M. Morari","doi":"10.1109/DMCC.1990.555439","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555439","url":null,"abstract":"The accurate, high-speed solution of systems of ordinary differential-algebraic equations (DAE’s) of low index is of great importance in chemical, electrical and other engineering disciplines. Petzold’s Fortran-based DASSL is the most widely used sequential code for solving DAE’s. We have devised and implemented a completely new C code, Concurrent DASSL, specifically for multicomputers and patterned on DASSL. In this work, we address the issues of data distribution and the performance of the overall algorithm, rather than just that of individual steps. Concurrent DASSL is designed as an open, application-independent environment below which linear algebra algorithms may be added in addition to standard support for dense and sparse algorithms. The user may furthermore attach explicit data interconversions between the main computational steps, or choose compromise distributions. A “problem formulator” (simulation layer) must be constructed above Concurrent DASSL, for any specific problem domain. We indicate performance for a particular chemical engineering application, a sequence of coupled distillation columns. Future efforts are cited in conclusion.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128019193","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 7

An Orthogonal Multiprocessor With Snooping Caches 具有窥探缓存的正交多处理器

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556275

P. Raja, S. Ganesan

This paper discusses the design and implementation of a multiprocessor for scientific calculations using snooping caches. Architecture of the orthogonal multiprocessor, cache coherency problem, cache coherency protocol, performance analysis, response time equations, cache memory architecture, etc., are discussed.

本文讨论了一种基于窥探缓存的科学计算多处理器的设计与实现。讨论了正交多处理器的体系结构、缓存一致性问题、缓存一致性协议、性能分析、响应时间方程、缓存存储器体系结构等。

引用次数: 1

A Scheme for Supporting Automatic Data Migration on Multlcomputers 支持多台计算机上数据自动迁移的方案

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556314

S. Mirchandaney, J. Saltz, P. Mehrotra, H. Berryman

Abstract : A data migration mechanism is proposed that allows an explicit and controlled mapping of data to memory. While read or write or write copies of each data element can be assigned to any processor's memory, longer term storage of each data element is assigned to a specific location in the memory of a particular processor. Data is presented that suggests that the scheme may be a practical method for efficiently supporting data migration. Keywords: Distributed machines, Data migration, Cacheing.

摘要:提出了一种数据迁移机制，允许数据到内存的显式和受控映射。虽然每个数据元素的读、写或写副本可以分配到任何处理器的内存中，但每个数据元素的较长期存储被分配到特定处理器内存中的特定位置。数据表明，该方案可能是一种有效支持数据迁移的实用方法。关键词:分布式机器，数据迁移，缓存。

引用次数: 20

A Hypercube Application in Large Scale Composite Materials Modeling 超立方体在大规模复合材料建模中的应用

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555425

C. Baldwin, S. Durham, J. D. Lynch, W. J. Padgett

This large scale application combines several areas of research to develop computational models for simulating the failure mechanisms of composite materials consisting of brittle fibers (such as carbon) embedded in a matrix material (such as epoxy resin). The simulations combine the ideas of structural stress analysis, numerical linear algebra, and visualization techniques to model the behavior of fibrous composites under uniaxial tensile load. This will allow laboratory experiments to be extrapolated more accurately to real applications, providing an enhanced capability to optimize designs of large structures made of composite materials with less extensive and costly experimental programs. Further, system performance and reliability may be improved substantially. In this paper a brief discussion of the theory of composite materials as it relates to the simulations will first be given. Next the procedures used to generate and analyze the structure will be presented. The computational techniques used to perform the simulation will be given as well as results from selected test cases. A summary of results and future directions in this research will be given at the end of the paper.

这种大规模的应用结合了几个研究领域，开发了用于模拟由脆性纤维(如碳)嵌入基体材料(如环氧树脂)组成的复合材料的破坏机制的计算模型。该模拟结合了结构应力分析、数值线性代数和可视化技术的思想来模拟纤维复合材料在单轴拉伸载荷下的行为。这将允许实验室实验更准确地外推到实际应用中，提供增强的能力，优化设计由复合材料制成的大型结构，较少的广泛和昂贵的实验程序。此外，系统的性能和可靠性可能大大提高。本文首先对与模拟有关的复合材料理论作了简要的讨论。接下来将介绍用于生成和分析结构的程序。将给出用于执行模拟的计算技术以及来自选定测试用例的结果。本文最后总结了本研究的结果和未来的研究方向。

{"title":"A Hypercube Application in Large Scale Composite Materials Modeling","authors":"C. Baldwin, S. Durham, J. D. Lynch, W. J. Padgett","doi":"10.1109/DMCC.1990.555425","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555425","url":null,"abstract":"This large scale application combines several areas of research to develop computational models for simulating the failure mechanisms of composite materials consisting of brittle fibers (such as carbon) embedded in a matrix material (such as epoxy resin). The simulations combine the ideas of structural stress analysis, numerical linear algebra, and visualization techniques to model the behavior of fibrous composites under uniaxial tensile load. This will allow laboratory experiments to be extrapolated more accurately to real applications, providing an enhanced capability to optimize designs of large structures made of composite materials with less extensive and costly experimental programs. Further, system performance and reliability may be improved substantially. In this paper a brief discussion of the theory of composite materials as it relates to the simulations will first be given. Next the procedures used to generate and analyze the structure will be presented. The computational techniques used to perform the simulation will be given as well as results from selected test cases. A summary of results and future directions in this research will be given at the end of the paper.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129395267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Parallel Nonlinear Optimization 并行非线性优化

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555394

Ron Daniel

This paper describes the implementation of a parallel Levenberg-Marquardt algorithm on an iPSC/2. The Levenberg-Marquardt algorithm is a standard technique for non-linear least-squares optimization. For a problem with D data points and P parameters to be estimated, each iteration requires that the objective function and its P partials be evaluated at all D data points, using the current parameter estimates. Each iteration also requires the solution of a PxP linear system to obtain the next set of parameter estimates. A simple data-parallel decomposition is used where the data is evenly distributed across the nodes to parallelize the evaluations of the objective function and its partial derivatives. The performance of the method is characterized versus the number of nodes, the number of data points, and the number of parameters in the objective function. Further enhancements are also discussed.

本文描述了一种并行Levenberg-Marquardt算法在iPSC/2上的实现。Levenberg-Marquardt算法是非线性最小二乘优化的标准技术。对于一个需要估计D个数据点和P个参数的问题，每次迭代都需要使用当前参数估计在所有D个数据点上评估目标函数及其P个偏值。每次迭代还需要解一个PxP线性系统以获得下一组参数估计。使用简单的数据并行分解，其中数据均匀分布在节点上，以并行化目标函数及其偏导数的评估。该方法的性能与目标函数中节点的数量、数据点的数量和参数的数量有关。还讨论了进一步的增强。

引用次数: 4

Applications of Adaptive Data Distributions 自适应数据分布的应用

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555391

E. F. Van de Velde, J. Lorenz

Continuation methods compute paths of solutions of nonlinear equations that depend on a parameter. This paper examines some aspects of the multicomputer implementation of such methods. The computation is done on the Symult Series 2010 multicomputer. One of the main issues in the development of concurrent programs is load balancing, achieved here by using appropriate data distributions. In the continuation process, a large number of linear systems have to be solved. For nearby points along the solution path, the corresponding system matrices are closely related to each other. Therefore, pivots which are good for the LU-decomposition of one matrix are likely to be acceptable for a whole segment of the solution path. This suggests to choose certain data distributions that achieve good load balancing. In addition, if these distributions are used, the resulting code is easily vectorized. To test this technique, the invariant manifold of a system of two identical nonlinear oscillators is computed as a function of the coupling between them. This invariant manifold is determined by the solution of a system of nonlinear partial differential equations that depends on the coupling parameter. A symmetry in the problem reduces this system to one single equation, which is discretized by finite differences. The solution of this discrete nonlinear system is followed as the coupling parameter is changed.

延拓法计算依赖于参数的非线性方程的解的路径。本文探讨了这些方法的多机实现的一些方面。计算是在Symult Series 2010多台计算机上完成的。并发程序开发中的一个主要问题是负载平衡，这是通过使用适当的数据分布来实现的。在延拓过程中，需要求解大量的线性系统。对于解路径附近的点，对应的系统矩阵彼此密切相关。因此，对一个矩阵的lu分解有利的支点可能对整个解路径段都是可接受的。这建议选择某些能够实现良好负载平衡的数据分布。此外，如果使用这些分布，生成的代码很容易向量化。为了验证这一技术，我们计算了两个相同非线性振子组成的系统的不变流形作为它们之间耦合的函数。该不变流形由依赖于耦合参数的非线性偏微分方程组的解决定。该问题的对称性将该系统简化为一个单一的方程，该方程被有限差分离散。随着耦合参数的变化，得到了该离散非线性系统的解。

{"title":"Applications of Adaptive Data Distributions","authors":"E. F. Van de Velde, J. Lorenz","doi":"10.1109/DMCC.1990.555391","DOIUrl":"https://doi.org/10.1109/DMCC.1990.555391","url":null,"abstract":"Continuation methods compute paths of solutions of nonlinear equations that depend on a parameter. This paper examines some aspects of the multicomputer implementation of such methods. The computation is done on the Symult Series 2010 multicomputer. One of the main issues in the development of concurrent programs is load balancing, achieved here by using appropriate data distributions. In the continuation process, a large number of linear systems have to be solved. For nearby points along the solution path, the corresponding system matrices are closely related to each other. Therefore, pivots which are good for the LU-decomposition of one matrix are likely to be acceptable for a whole segment of the solution path. This suggests to choose certain data distributions that achieve good load balancing. In addition, if these distributions are used, the resulting code is easily vectorized. To test this technique, the invariant manifold of a system of two identical nonlinear oscillators is computed as a function of the coupling between them. This invariant manifold is determined by the solution of a system of nonlinear partial differential equations that depends on the coupling parameter. A symmetry in the problem reduces this system to one single equation, which is discretized by finite differences. The solution of this discrete nonlinear system is followed as the coupling parameter is changed.","PeriodicalId":204431,"journal":{"name":"Proceedings of the Fifth Distributed Memory Computing Conference, 1990.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1990-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132948075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conjugate Gradient Methods for Spline Collocation Equations 样条配配方程的共轭梯度法

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.555433

C. Christara

We study the parallel computation of linear second order elliptic Partial Differential Equation (PDE) problems in rectangular domains. We discuss the application of Conjugate Gradient (CG) and Preconditioned Conjugate Gradient (PCG) methods to the linear system arising from the discretisation of such problems using quadratic splines and the collocation discretisation methodology. Our experiments show that the number of iterations required for convergence of CG-QSC (Conjugate Gradient applied to Quadratic Spline Collocation equations) grows linearly with the square root of the number of equations. We implemented the CG and PCG methods for the solution of the Quadratic Spline Collocation (QSC) equations on the iPSC/2 hypercube and present performance evaluation results for up to 32 processors configurations. Our experiments show efficiencies of the order of 90%, for both the fixed and scaled speedups.

研究了矩形域上线性二阶椭圆型偏微分方程问题的并行计算。我们讨论了共轭梯度(CG)和预条件共轭梯度(PCG)方法在线性系统中的应用，这些线性系统是由使用二次样条和配置离散化方法引起的。我们的实验表明，CG-QSC(应用于二次样条配置方程的共轭梯度)收敛所需的迭代次数与方程数量的平方根线性增长。我们在iPSC/2超立方体上实现了求解二次样条配置(QSC)方程的CG和PCG方法，并给出了多达32个处理器配置的性能评估结果。我们的实验表明，对于固定和缩放速度，效率都达到90%。

引用次数: 4

Synchronized Blocking in a Distributed Memory System 分布式内存系统中的同步阻塞

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556390

C. G. Rommel

引用次数: 0

An SIMD Multiprocessor Using DSP Microprocessors 采用DSP微处理器的SIMD多处理器

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556273

S. Ganesan, P. Raja

In this paper, we have discussed a parallel processor design using DSP microprocessors and dual-port RAMs(DPRs) for image processing applications and scientific computations. This parallel processor uses eight TMS320C25 Digital Signal Processors (DSPs) and dual-port RAMs. Application of matrix multiplication algorithms and Image processing algorithms to this architecture are discussed.

在本文中，我们讨论了一种使用DSP微处理器和双端口ram (DPRs)进行图像处理和科学计算的并行处理器设计。该并行处理器采用8个TMS320C25数字信号处理器(dsp)和双端口ram。讨论了矩阵乘法算法和图像处理算法在该体系结构中的应用。

引用次数: 0

Graceful Degradation on Hypercube Multiprocessors Using Data Redistribution 使用数据重分配的超立方体多处理器的优雅降级

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

Pub Date : 1990-04-08 DOI: 10.1109/DMCC.1990.556408

C. Li, W. Fuchs

A data redistribution approach to graceful degradation is described in this paper for hypercube multiprocessors. CPU-bound hypercube programs using the described second-order parametrized data distribution technique can run on a group of cubes of any size to achieve graceful degradation without recompila tion. A transmission mechanism has been designed to switch the performance of a second-order parametrized data distribution hypercube program to that of a corresponding first-order program when the latter is superior. A package of procedures has been implemented on the Intel iPSC/2 hypercube to support the approach.

本文描述了一种用于超立方体多处理器的数据再分配方法。使用所描述的二阶参数化数据分布技术的cpu绑定超多维数据集程序可以在任何大小的多维数据集上运行，从而实现优雅的降级，而无需重新编译。设计了一种传输机制，当二阶参数化数据分布超立方体程序的性能优于一阶程序时，将其转换为相应的一阶程序。已经在Intel iPSC/2超立方体上实现了一个程序包来支持这种方法。

引用次数: 14

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Fifth Distributed Memory Computing Conference, 1990.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀