Proceedings Scalable High Performance Computing Conference SHPCC-92.最新文献

英文中文

Scalability issues for a class of CFD applications 一类CFD应用程序的可伸缩性问题

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232632

V. Naik

Considers the performance scalability of a class of computational fluid dynamics applications. The results indicate that neither the scalability in time nor the scalability in problem size can be obtained by simply scaling up the processing power. Results are presented to show that latency, packet size, and transmission speeds play an important role. However, improvements only in the architectural parameters are not sufficient to realize full performance scalability. Suitable partitioning and algorithmic parameters must be selected for each type of architecture.<>

考虑一类计算流体动力学应用程序的性能可伸缩性。结果表明，简单地增加处理能力既不能获得时间上的可扩展性，也不能获得问题规模上的可扩展性。结果表明，延迟、数据包大小和传输速度起着重要的作用。然而，仅在体系结构参数上进行改进不足以实现完全的性能可伸缩性。必须为每种类型的体系结构选择合适的分区和算法参数。

引用次数: 15

HYPERGEN-a distributed genetic algorithm on a hypercube hypergen -一个在超立方体上的分布式遗传算法

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232638

L. Knight, R. L. Wainwright

The genetic algorithm is a robust search and optimization technique based on the principles of natural genetics and survival of the fittest. Genetic algorithms (GA) are a promising new approach to global optimization problems, and are applicable to a wide variety of problems. HYPERGEN was developed as a research tool for investigating parallel genetic algorithms applied to combinatorial optimization problems. It provides the user with a wide variety of options to test the particular problem at hand. In addition, HYPERGEN is modular enough for a user to insert routines of his own for special needs, or for doing further research studies on parallel GAs. HYPERGEN was used successfully to find new 'best' tours on three 'standard' TSP problems, and out-performed a parallel simulated annealing algorithm on various package placement problems. The authors found it fairly easy to fine tune the parameters that drive a parallel GA for near optimal performance (population size, migration rate, and migration interval).<>

遗传算法是一种基于自然遗传学和适者生存原则的鲁棒搜索和优化技术。遗传算法(GA)是一种很有前途的求解全局优化问题的新方法，适用于各种各样的问题。HYPERGEN是一个研究并行遗传算法在组合优化问题中的应用的工具。它为用户提供了各种各样的选项来测试手头的特定问题。此外，HYPERGEN是模块化的，用户可以根据特殊需要插入自己的例程，或者对并行GAs进行进一步的研究。HYPERGEN成功地在三个“标准”TSP问题上找到了新的“最佳”路径，并在各种包装放置问题上优于并行模拟退火算法。作者发现，很容易微调驱动并行遗传算法的参数，以获得接近最佳的性能(种群大小、迁移率和迁移间隔)。

引用次数: 18

A matrix product algorithm and its comparative performance on hypercubes 一种矩阵积算法及其在超立方体上的比较性能

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232648

C. Lin, L. Snyder

A matrix product algorithm is studied in which one matrix operand is transposed prior to the computation. This algorithm is compared with the Fox-Hey-Otto algorithm on hypercube architectures. The Transpose algorithm simplifies communication for nonsquare matrices and for computations where the number of processors is not a perfect square. The results indicate superior performance for the Transpose algorithm.<>

研究了一种矩阵乘积算法，该算法在计算前先对一个矩阵操作数进行转置。将该算法与超立方体架构下的Fox-Hey-Otto算法进行了比较。转置算法简化了非平方矩阵和处理器数量不是完全平方的计算的通信。结果表明，转置算法具有较好的性能

引用次数: 26

Improving the performance of message-passing applications by multithreading 通过多线程提高消息传递应用程序的性能

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232684

E. Felten, D. McNamee

Achieving maximum performance in message-passing programs requires that calculation and communication be overlapped. However, the program transformations required to achieve this overlap are error-prone and add significant complexity to the application program. The authors argue that calculation/communication overlap can be achieved easily and consistently by executing multiple threads of control on each processor, and that this approach is practical on message-passing architectures without any special hardware support. They present timing data for a typical message-passing application, to demonstrate the advantages of the scheme.<>

在消息传递程序中实现最大性能要求计算和通信重叠。然而，实现这种重叠所需的程序转换是容易出错的，并且给应用程序程序增加了显著的复杂性。作者认为，计算/通信重叠可以通过在每个处理器上执行多个控制线程轻松而一致地实现，并且这种方法在没有任何特殊硬件支持的消息传递体系结构上是实用的。他们给出了一个典型的消息传递应用程序的定时数据，以演示该方案的优点。

引用次数: 57

Image algebra: an object oriented approach to transparently concurrent image processing 图像代数:面向对象的透明并发图像处理方法

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232694

I. Angus

The image algebra formalism provides a succinct high level algebraic method of describing many image processing algorithms. By exploiting this formalism it is possible to map the mathematical algorithms, for which pixel level parallelism is transparent, into C++ computer code which can be portable across all MIMD, SIMD, and sequential architectures. The advantage of this method is that complex image processing algorithms can now be prototyped and tested on any machine and then safely migrated directly to parallel machines without any demands being placed upon the user by issues such as parallelism and data decomposition.<>

图像代数形式化提供了一种简洁的高级代数方法来描述许多图像处理算法。通过利用这种形式化，可以将数学算法(像素级并行性是透明的)映射到可以跨所有MIMD、SIMD和顺序体系结构移植的c++计算机代码中。这种方法的优点是，复杂的图像处理算法现在可以在任何机器上进行原型和测试，然后安全地直接迁移到并行机器上，而不会因为并行性和数据分解等问题而对用户提出任何要求

引用次数: 7

The Multicomputer Toolbox approach to concurrent BLAS and LACS 并行BLAS和LACS的多计算机工具箱方法

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232679

R. Falgout, A. Skjellum, S.G. Smith, C. Still

The authors describe many of the issues involved in general-purpose concurrent basic linear algebra subprograms (concurrent BLAS or CBLAS) and discuss data-distribution independence, while further generalizing data distributions. They comment on the utility of linear algebra communication subprograms (LACS). They also describe an algorithm for dense matrix-matrix multiplication and also discuss matrix-vector multiplication issues. With regard to communication, they conclude that there is limited leverage in LACS per se as a stand-alone message-passing standard, and propose that needed capabilities instead be integrated in a general, application-level message passing standard, focusing attention on CBLAS and large-scale application needs. Most of the proposed LACS features are similar to existing or needed general-purpose primitives anyway. All of the ideas discussed have been implemented or are under current development within the Multicomputer Toolbox open software system.<>

作者描述了通用并发基本线性代数子程序(并发BLAS或CBLAS)中涉及的许多问题，并讨论了数据分布独立性，同时进一步推广了数据分布。他们评论了线性代数通信子程序(LACS)的实用性。他们还描述了密集矩阵-矩阵乘法的算法，并讨论了矩阵-向量乘法问题。关于通信，他们得出结论，LACS本身作为独立的消息传递标准的作用有限，并建议将所需的功能集成到通用的应用程序级消息传递标准中，将注意力集中在CBLAS和大规模应用程序需求上。大多数提议的LACS特性都类似于现有的或需要的通用原语。所有讨论的想法都已经在多计算机工具箱开放软件系统中实现或正在开发中。

引用次数: 32

Monte Carlo particle simulation of low-density fluid flow on MIMD supercomputers 在MIMD超级计算机上低密度流体流动的蒙特卡罗粒子模拟

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232643

S. Plimpton, T. Bartel

Direct simulation Monte Carlo is a well-established technique for modeling low density fluid flows. The parallel implementation of a general simulation which allows for body-fitted grids, particle weighting, and a variety of surface and flow chemistry models is described. The authors compare its performance on a 1024-node nCUBE 2 to a serial version for the CRAY-YMP. Experiences with load-balancing the computation via graph-based heuristics and the newer spectral techniques are also discussed. This is a critical issue, since density fluctuations can create orders-of-magnitude differences in computational loads as the simulation progresses.<>

直接模拟蒙特卡罗是一种成熟的模拟低密度流体流动的技术。并行实现的一般模拟，允许体贴合网格，粒子加权，以及各种表面和流动化学模型的描述。作者将其在1024节点nCUBE 2上的性能与CRAY-YMP的串行版本进行了比较。本文还讨论了利用基于图的启发式算法和较新的谱技术进行负载平衡计算的经验。这是一个关键问题，因为随着模拟的进行，密度波动会在计算负载上产生数量级的差异。

引用次数: 2

Programming an astrophysics application in an object-oriented parallel language 用面向对象的并行语言编写天体物理学应用程序

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232637

S. X. Yang, Jenq-Kuen Lee, S. Narayana, Dennis Gannon

A three-dimensional hydrodynamic code is used to test a newly developed parallel C++ (pC++) language and compiler. The original code is written in FORTRAN77 and is designed to model self-gravitating compressible gas flows. The code is rewritten in pC++ and is tested on a BBN GP1000 and an Alliant FX/2800. Nearly linear speed-up is achieved on both machines. On the Alliant comparison between the pC++ code and the original FORTRAN77 code is conducted. For processor numbers >or=6, the pC++ code outperforms the FORTRAN77 code which is automatically vectorized and parallelized by the Alliant FORTRAN compiler.<>

用三维流体力学代码对新开发的并行c++语言和编译器进行了测试。原始代码是用FORTRAN77编写的，旨在模拟自重力可压缩气体流动。代码在pc++中重写，并在BBN GP1000和Alliant FX/2800上进行了测试。在两台机器上都实现了近乎线性的加速。在此基础上，对pc++代码与FORTRAN77原始代码进行了Alliant比较。对于处理器编号>或=6,pc++代码优于FORTRAN77代码，后者由Alliant FORTRAN编译器自动向量化和并行化。

引用次数: 6

Parallel molecular dynamics on a torus network 环面网络上的平行分子动力学

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232681

K. Esselink, P. Hilbers

The paper presents some theoretical results concerning molecular dynamics simulations on parallel networks. Specifically, it gives rules which, depending on the system to be simulated and on the processor network, gives the optimal mapping for a class of algorithms. It also shows that multi-particle potentials can efficiently be implemented when geometric parallelism is used. The paper demonstrates the approach by showing some results of simulations of water/oil/surfactant and of polymer systems on a toroidal network of transputers. Furthermore, it compares timing results of some simulations performed on this network with those performed on a Cray single-processor machine.<>

本文给出了平行网络分子动力学模拟的一些理论结果。具体来说，它给出了一些规则，这些规则取决于要模拟的系统和处理器网络，给出了一类算法的最优映射。结果表明，利用几何平行度可以有效地实现多粒子势。文中给出了水/油/表面活性剂和聚合物体系在环形转盘网络上的一些模拟结果，以证明该方法的可行性。此外，它还比较了在该网络上执行的一些仿真结果与在Cray单处理器机器上执行的仿真结果。

引用次数: 4

Visual-aural representations of performance for a scalable application program 可伸缩应用程序性能的视觉-听觉表示

Proceedings Scalable High Performance Computing Conference SHPCC-92.

Pub Date : 1992-04-26 DOI: 10.1109/SHPCC.1992.232662

J. M. Francioni, D. Rover

Visual and aural portrayals of parallel program execution are used to gain insight into how a program is working. The combination of portrayals in a coordinated performance environment provides the user with multiple perspectives and stimuli to comprehend complex, multidimensional run-time information. An open question for either medium is how well does it scale? That is, how effectively can it be used to represent program performance on a large parallel computer system? This paper investigates using sound in conjunction with graphics to represent the performance of a scalable application program, the SLALOM benchmark program, executed on the nCUBE 2 distributed memory parallel computer. Custom auralization software is coupled with the PICL and ParaGraph tools. The techniques and results of visually and aurally monitoring program execution on increasing numbers of processors are presented.<>

并行程序执行的视觉和听觉描述用于深入了解程序是如何工作的。协调性能环境中的描述组合为用户提供了多个透视图和刺激，以理解复杂的多维运行时信息。对于这两种媒体来说，一个悬而未决的问题是它的可扩展性有多好?也就是说，在大型并行计算机系统中，如何有效地使用它来表示程序性能?本文研究了在nCUBE 2分布式内存并行计算机上使用声音和图形来表示可扩展应用程序SLALOM基准程序的性能。定制化软件与PICL和ParaGraph工具相结合。本文介绍了在越来越多的处理器上进行程序执行的视觉和听觉监测的技术和结果

引用次数: 11

首页上一页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Scalable High Performance Computing Conference SHPCC-92.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀