The Sixth Distributed Memory Computing Conference, 1991. Proceedings最新文献

英文中文

Scalable Performance Environments for Parallel Systems 并行系统的可扩展性能环境

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-10-18 DOI: 10.1109/DMCC.1991.633315

Daniel A. Reed, R. D. Olson, R. Aydt, Tara M. Madhyastha, T. Birkett, David W. Jensen, B. Nazief, B. K. Totty

As parallel systems expand in size and complexity, the absence of performance tools for these parallel systems exacerbates the already difficult problems of application program and system software performance tuning. Moreover, given the pace of technological change, we can no longer afford to develop ad hoc, one-of-a-kind performance instrumentation software; we need scalable, portable performance analysis tools. We describe an environment prototype based on the lessons learned from two previous generations of performance data analysis software. Our environment prototype contains a set of performance data transformation modules that can be interconnected in user-specified ways. It is the responsibility of the environment infrastructure to hide details of module interconnection and data sharing. The environment is written in C++ with the graphical displays based on X windows and the Motif toolkit. It allows users to interconnect and configure modules graphically to form an acyclic, directed data analysis graph. Performance trace data are represented in a self-documenting stream format that includes internal definitions of data types, sizes, and names. The environment prototype supports the use of head-mounted displays and sonic data presentation in addition to the traditional use of visual techniques.

随着并行系统在规模和复杂性上的扩展，这些并行系统的性能工具的缺乏加剧了应用程序和系统软件性能调优的困难问题。此外，考虑到技术变革的步伐，我们再也负担不起开发专门的、独一无二的性能仪表软件;我们需要可扩展的、便携的性能分析工具。我们根据从前两代性能数据分析软件中吸取的经验教训描述了一个环境原型。我们的环境原型包含一组性能数据转换模块，这些模块可以以用户指定的方式相互连接。隐藏模块互连和数据共享的细节是环境基础设施的责任。环境是用c++编写的，具有基于X窗口的图形显示和Motif工具包。它允许用户以图形方式互连和配置模块，以形成无循环、有向的数据分析图。性能跟踪数据以自文档流格式表示，该格式包括数据类型、大小和名称的内部定义。除了传统的视觉技术外，环境原型还支持使用头戴式显示器和声音数据呈现。

{"title":"Scalable Performance Environments for Parallel Systems","authors":"Daniel A. Reed, R. D. Olson, R. Aydt, Tara M. Madhyastha, T. Birkett, David W. Jensen, B. Nazief, B. K. Totty","doi":"10.1109/DMCC.1991.633315","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633315","url":null,"abstract":"As parallel systems expand in size and complexity, the absence of performance tools for these parallel systems exacerbates the already difficult problems of application program and system software performance tuning. Moreover, given the pace of technological change, we can no longer afford to develop ad hoc, one-of-a-kind performance instrumentation software; we need scalable, portable performance analysis tools. We describe an environment prototype based on the lessons learned from two previous generations of performance data analysis software. Our environment prototype contains a set of performance data transformation modules that can be interconnected in user-specified ways. It is the responsibility of the environment infrastructure to hide details of module interconnection and data sharing. The environment is written in C++ with the graphical displays based on X windows and the Motif toolkit. It allows users to interconnect and configure modules graphically to form an acyclic, directed data analysis graph. Performance trace data are represented in a self-documenting stream format that includes internal definitions of data types, sizes, and names. The environment prototype supports the use of head-mounted displays and sonic data presentation in addition to the traditional use of visual techniques.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122370339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 111

Design and Evaluation of Communication Processors supporting Message Passing in Distributed Memory Systems 分布式存储系统中支持消息传递的通信处理器的设计与评价

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633352

H. Corporaal, J. Olk

The number of design decisions for connecting processor nodes within MIMD systems is rather large. This paper systematically introduces the most important design parameters for communication processors in MIMD systems. Together, these parameters span a multidimensional design space. Points in this space are clarijied through classijication of a number of existing communication processors. The design choices made for these processors are reviewed and their performance is evaluated. Suitable choices of the design parameters are highly influenced by application behavior. Ideally one would like to design processors which cover a whole area in this design space. A companion paper describes a scalable andflexible design currently being realized at our laboratory.

在MIMD系统中连接处理器节点的设计决策数量相当大。本文系统地介绍了MIMD系统中通信处理器的重要设计参数。总之，这些参数跨越了一个多维设计空间。通过对一些现有通信处理器的分类，澄清了这个空间中的要点。对这些处理器的设计选择进行了回顾，并对其性能进行了评估。应用行为对设计参数的选择有很大影响。理想情况下，我们希望设计的处理器能够覆盖这个设计空间的整个区域。另一篇论文描述了我们实验室目前正在实现的一种可扩展且灵活的设计。

引用次数: 3

Temporal Communication Graphs: A New Graph Theoretic Model Mapping and Scheduling in Distributed Memory Systems 时间通信图:一种新的图论模型在分布式存储系统中的映射和调度

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633137

V. Lo

The temporal communication graph is a new graph theoretic model of parallel computation that we have developed for the mapping of parallel computations to message-passing parallel architectures. The TCG integrates the two dominant models currently in use in the area of mapping, task assignment, partitioning, and scheduling: the static task graph and the DAG. The TCG augments these models with the capability to identify logically synchronous phases of communication and computation, and to describe the temporal behavior of a parallel algorithm in terms of these phases. This paper defines the TCG, illustrates its use for mapping and scheduling, and discusses a wide range of potentials uses for the TCG in the area of parallel programming environments.

时序通信图是一种新的并行计算图论模型，是我们为将并行计算映射到消息传递并行体系结构而开发的。TCG集成了目前在映射、任务分配、分区和调度领域使用的两种主要模型:静态任务图和DAG。TCG增强了这些模型，使其能够识别通信和计算的逻辑同步阶段，并根据这些阶段描述并行算法的时间行为。本文定义了TCG，说明了它在映射和调度方面的用途，并讨论了TCG在并行编程环境领域的广泛潜在用途。

引用次数: 5

Apply: A Parallel Compiler on iWarp for Image-Processing Applications 应用:一个并行编译器上的iWarp图像处理应用程序

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633119

B. Baxter, B. Greer

Local operator compurafions used in 2-dimensional image processing can be applied TO individual pixels independenr ly. making ir easy ro perform This class ofproblems in paral lel. The Apply language is designed to exploir rhis OppOrlU niry while hiding mosr parallel programming dewilsfrom the programmer andrewining rhe lookandfeel ofa conven fional sequential style.

在二维图像处理中使用的局部算子计算可以独立地应用于单个像素。使这类问题的并行求解变得容易。Apply语言的设计目的是利用这种机会，同时对程序员隐藏更多的并行编程细节，并保留传统顺序风格的外观和感觉。

引用次数: 5

Using Spanning-Trees for Balancing Dynamic Load on Multiprocessors 用生成树来平衡多处理器的动态负载

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633134

R. Melhem, K. Pruhs, T. Znati

We consider the problem of load balancing to minimize the cost of dynamic computations, including the cost of migrations. We analyze the costs associated with diffusion based algorithms for several common architectures. We introduce the Ripple load balancing paradigm, which has several advantages over diffusion methods, including flexibility and faster convergence.

我们考虑负载平衡问题，以最小化动态计算的成本，包括迁移的成本。我们分析了几种常见架构下基于扩散的算法的相关成本。我们介绍了Ripple负载平衡范式，它比扩散方法有几个优点，包括灵活性和更快的收敛性。

引用次数: 3

Executing Synchronous Data Flow Graphs on Multicomputers 在多台计算机上执行同步数据流程图

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633128

Zhiwei Xu

This paper presents a technique for programming distributed memory multicomputers by automatically generating parallel programs from parallel computations specified as synchronous data flow graphs or recurrence equations.

引用次数: 1

Reduction operations on a distributed memory machine with a reconfigurable interconnection network 具有可重构互连网络的分布式存储器机器上的约简操作

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633354

S. Miguet, Yves Robert

Performing reduction operations with distributed memory machines whose interconnection networks are reconfigurable is considered. The focus is on machines whose interconnection graph can be configured as any graph of maximum degree d. The best way of interconnecting the p processors as a function of p,d and some problem- and machine-dependent parameters that characterize the ratio communication/arithmetic for the reduction operation are discussed. Experiments on transputer-based networks are in good accordance with the theoretical results. >

考虑在互连网络可重构的分布式存储机上进行约简运算。重点是那些互连图可以配置为最大度d的任何图的机器。作为p,d和表征约简操作的比率通信/算法的一些问题和机器相关参数的函数，讨论了互连p个处理器的最佳方法。在基于传输器的网络上的实验结果与理论结果吻合良好。>

引用次数: 4

A Real-Time Parallel Algorithm Animation System 一个实时并行算法动画系统

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633314

E. Williams, G.B. Lament

Algorithm animation is a visualization method used to enhance understanding of the functioning of an algorithm or program. Visualization is used for many purposes, including education, algorithm research, performance analysis, and program debugging. This research applies algorithm animation techniques to programs developed for parallel architectures, with specific emphasis on the Intel iPSC/2 hypercube. Current, investigations focus in two different areas: performance data display and animations of specific algorithms or classes of algorithms. This research builds on these efforts to provide a system that is able to both display performance data from parallel programs and support the creation of animations for specific algorithms. There are three goals for this visualization system. Data should be displayed as it is generated. The inteiface to the target program should be transparent, allowing the animation of existing programs. The system must be flexible enough to animate any algorithm. The resulting system incorporates, integrates and extends two systems: the AFIT Algorithm Animation Research Facility (AAARF) and the Parallel Resource Analysis Software Environment (PRASE). Since performance data is an essential part of analyzing any parallel program, multiple views of the performance data are provided as an elementary part of the system. In addition to the animation system, a method for developing the animations is discussed. This method is arpplicable to animating any type of program, sequential or parallel. Whilc: both P-time and NP-ttme algorithms can potentially benefit from using visualization techniques, the set of NP .complete problems provides fertile ground for developing parallel atpplications. The methods discussed in this paper were used to animate a parallel implementation of a general Set Covering Problem (SCP).

算法动画是一种可视化方法，用于增强对算法或程序功能的理解。可视化有很多用途，包括教育、算法研究、性能分析和程序调试。本研究将算法动画技术应用于为并行架构开发的程序，特别强调了英特尔iPSC/2超立方体。目前，研究集中在两个不同的领域:性能数据显示和特定算法或算法类的动画。本研究建立在这些努力的基础上，以提供一个能够显示并行程序的性能数据并支持为特定算法创建动画的系统。这个可视化系统有三个目标。数据应该在生成时显示。目标程序的接口应该是透明的，允许现有程序的动画化。该系统必须足够灵活，以激活任何算法。由此产生的系统包含、集成和扩展了两个系统:AFIT算法动画研究设施(AAARF)和并行资源分析软件环境(PRASE)。由于性能数据是分析任何并行程序的基本部分，因此提供了性能数据的多个视图作为系统的基本部分。除了动画系统外，还讨论了动画的开发方法。这种方法适用于动画的任何类型的程序，顺序或并行。虽然P-time和NP- time算法都可以从使用可视化技术中获益，但NP完全问题集为开发并行应用程序提供了肥沃的土壤。本文所讨论的方法被应用于一般集合覆盖问题(SCP)的并行实现。

{"title":"A Real-Time Parallel Algorithm Animation System","authors":"E. Williams, G.B. Lament","doi":"10.1109/DMCC.1991.633314","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633314","url":null,"abstract":"Algorithm animation is a visualization method used to enhance understanding of the functioning of an algorithm or program. Visualization is used for many purposes, including education, algorithm research, performance analysis, and program debugging. This research applies algorithm animation techniques to programs developed for parallel architectures, with specific emphasis on the Intel iPSC/2 hypercube. Current, investigations focus in two different areas: performance data display and animations of specific algorithms or classes of algorithms. This research builds on these efforts to provide a system that is able to both display performance data from parallel programs and support the creation of animations for specific algorithms. There are three goals for this visualization system. Data should be displayed as it is generated. The inteiface to the target program should be transparent, allowing the animation of existing programs. The system must be flexible enough to animate any algorithm. The resulting system incorporates, integrates and extends two systems: the AFIT Algorithm Animation Research Facility (AAARF) and the Parallel Resource Analysis Software Environment (PRASE). Since performance data is an essential part of analyzing any parallel program, multiple views of the performance data are provided as an elementary part of the system. In addition to the animation system, a method for developing the animations is discussed. This method is arpplicable to animating any type of program, sequential or parallel. Whilc: both P-time and NP-ttme algorithms can potentially benefit from using visualization techniques, the set of NP .complete problems provides fertile ground for developing parallel atpplications. The methods discussed in this paper were used to animate a parallel implementation of a general Set Covering Problem (SCP).","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131952211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Structured Decompositions for Solving Sparse Nonlinear Systems of Equations on Parallel Computers 并行计算机上求解稀疏非线性方程组的结构化分解

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633304

Xiaodong Zhang

The structured decompositions for solving sparse nonlinear systems of equations are to transform the sparse systems into some special structures so that the computations can be decomposed efficiently for parallel processing. A group of structured nonlinear systems by decoinpositions and their parallel methods for solutions are overviewed.

求解稀疏非线性方程组的结构化分解是将稀疏方程组转化为特定的结构，使计算能够有效地分解，便于并行处理。本文综述了一组结构化非线性系统的分解及其并行求解方法。

引用次数: 0

Effective Storage and Communication Schemes for Implementation of the Conjugate Gradient Method on an Intel iPSC/860 在Intel iPSC/860上实现共轭梯度法的有效存储和通信方案

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633309

D. Anderson, L. Sattler

The conjugate gradient method for solving the system of linear equations arising during a finite element analysis has gained renewed interest with the advent of distributed memory computers. In this paper a method will be described which minimizes storage by taking advantage of symmetry and sparsity and minimizes communication overhead by using asynchronous message passing. The data structure necessary to implement this procedure follows naturally frotn the finite element mesh. Test results show near linear speedup for a suflciently large matrix.

随着分布式存储计算机的出现，求解有限元分析中出现的线性方程组的共轭梯度法重新引起了人们的兴趣。本文将描述一种利用对称和稀疏性最小化存储和使用异步消息传递最小化通信开销的方法。实现此过程所需的数据结构自然遵循有限元网格。测试结果表明，对于足够大的矩阵，该算法的加速接近线性。

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

The Sixth Distributed Memory Computing Conference, 1991. Proceedings

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀