首页 > 最新文献

The Sixth Distributed Memory Computing Conference, 1991. Proceedings最新文献

英文 中文
Scalable Performance Environments for Parallel Systems 并行系统的可扩展性能环境
Pub Date : 1991-10-18 DOI: 10.1109/DMCC.1991.633315
Daniel A. Reed, R. D. Olson, R. Aydt, Tara M. Madhyastha, T. Birkett, David W. Jensen, B. Nazief, B. K. Totty
As parallel systems expand in size and complexity, the absence of performance tools for these parallel systems exacerbates the already difficult problems of application program and system software performance tuning. Moreover, given the pace of technological change, we can no longer afford to develop ad hoc, one-of-a-kind performance instrumentation software; we need scalable, portable performance analysis tools. We describe an environment prototype based on the lessons learned from two previous generations of performance data analysis software. Our environment prototype contains a set of performance data transformation modules that can be interconnected in user-specified ways. It is the responsibility of the environment infrastructure to hide details of module interconnection and data sharing. The environment is written in C++ with the graphical displays based on X windows and the Motif toolkit. It allows users to interconnect and configure modules graphically to form an acyclic, directed data analysis graph. Performance trace data are represented in a self-documenting stream format that includes internal definitions of data types, sizes, and names. The environment prototype supports the use of head-mounted displays and sonic data presentation in addition to the traditional use of visual techniques.
随着并行系统在规模和复杂性上的扩展,这些并行系统的性能工具的缺乏加剧了应用程序和系统软件性能调优的困难问题。此外,考虑到技术变革的步伐,我们再也负担不起开发专门的、独一无二的性能仪表软件;我们需要可扩展的、便携的性能分析工具。我们根据从前两代性能数据分析软件中吸取的经验教训描述了一个环境原型。我们的环境原型包含一组性能数据转换模块,这些模块可以以用户指定的方式相互连接。隐藏模块互连和数据共享的细节是环境基础设施的责任。环境是用c++编写的,具有基于X窗口的图形显示和Motif工具包。它允许用户以图形方式互连和配置模块,以形成无循环、有向的数据分析图。性能跟踪数据以自文档流格式表示,该格式包括数据类型、大小和名称的内部定义。除了传统的视觉技术外,环境原型还支持使用头戴式显示器和声音数据呈现。
{"title":"Scalable Performance Environments for Parallel Systems","authors":"Daniel A. Reed, R. D. Olson, R. Aydt, Tara M. Madhyastha, T. Birkett, David W. Jensen, B. Nazief, B. K. Totty","doi":"10.1109/DMCC.1991.633315","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633315","url":null,"abstract":"As parallel systems expand in size and complexity, the absence of performance tools for these parallel systems exacerbates the already difficult problems of application program and system software performance tuning. Moreover, given the pace of technological change, we can no longer afford to develop ad hoc, one-of-a-kind performance instrumentation software; we need scalable, portable performance analysis tools. We describe an environment prototype based on the lessons learned from two previous generations of performance data analysis software. Our environment prototype contains a set of performance data transformation modules that can be interconnected in user-specified ways. It is the responsibility of the environment infrastructure to hide details of module interconnection and data sharing. The environment is written in C++ with the graphical displays based on X windows and the Motif toolkit. It allows users to interconnect and configure modules graphically to form an acyclic, directed data analysis graph. Performance trace data are represented in a self-documenting stream format that includes internal definitions of data types, sizes, and names. The environment prototype supports the use of head-mounted displays and sonic data presentation in addition to the traditional use of visual techniques.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122370339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 111
Design and Evaluation of Communication Processors supporting Message Passing in Distributed Memory Systems 分布式存储系统中支持消息传递的通信处理器的设计与评价
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633352
H. Corporaal, J. Olk
The number of design decisions for connecting processor nodes within MIMD systems is rather large. This paper systematically introduces the most important design parameters for communication processors in MIMD systems. Together, these parameters span a multidimensional design space. Points in this space are clarijied through classijication of a number of existing communication processors. The design choices made for these processors are reviewed and their performance is evaluated. Suitable choices of the design parameters are highly influenced by application behavior. Ideally one would like to design processors which cover a whole area in this design space. A companion paper describes a scalable andflexible design currently being realized at our laboratory.
在MIMD系统中连接处理器节点的设计决策数量相当大。本文系统地介绍了MIMD系统中通信处理器的重要设计参数。总之,这些参数跨越了一个多维设计空间。通过对一些现有通信处理器的分类,澄清了这个空间中的要点。对这些处理器的设计选择进行了回顾,并对其性能进行了评估。应用行为对设计参数的选择有很大影响。理想情况下,我们希望设计的处理器能够覆盖这个设计空间的整个区域。另一篇论文描述了我们实验室目前正在实现的一种可扩展且灵活的设计。
{"title":"Design and Evaluation of Communication Processors supporting Message Passing in Distributed Memory Systems","authors":"H. Corporaal, J. Olk","doi":"10.1109/DMCC.1991.633352","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633352","url":null,"abstract":"The number of design decisions for connecting processor nodes within MIMD systems is rather large. This paper systematically introduces the most important design parameters for communication processors in MIMD systems. Together, these parameters span a multidimensional design space. Points in this space are clarijied through classijication of a number of existing communication processors. The design choices made for these processors are reviewed and their performance is evaluated. Suitable choices of the design parameters are highly influenced by application behavior. Ideally one would like to design processors which cover a whole area in this design space. A companion paper describes a scalable andflexible design currently being realized at our laboratory.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"120 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115183275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Temporal Communication Graphs: A New Graph Theoretic Model Mapping and Scheduling in Distributed Memory Systems 时间通信图:一种新的图论模型在分布式存储系统中的映射和调度
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633137
V. Lo
The temporal communication graph is a new graph theoretic model of parallel computation that we have developed for the mapping of parallel computations to message-passing parallel architectures. The TCG integrates the two dominant models currently in use in the area of mapping, task assignment, partitioning, and scheduling: the static task graph and the DAG. The TCG augments these models with the capability to identify logically synchronous phases of communication and computation, and to describe the temporal behavior of a parallel algorithm in terms of these phases. This paper defines the TCG, illustrates its use for mapping and scheduling, and discusses a wide range of potentials uses for the TCG in the area of parallel programming environments.
时序通信图是一种新的并行计算图论模型,是我们为将并行计算映射到消息传递并行体系结构而开发的。TCG集成了目前在映射、任务分配、分区和调度领域使用的两种主要模型:静态任务图和DAG。TCG增强了这些模型,使其能够识别通信和计算的逻辑同步阶段,并根据这些阶段描述并行算法的时间行为。本文定义了TCG,说明了它在映射和调度方面的用途,并讨论了TCG在并行编程环境领域的广泛潜在用途。
{"title":"Temporal Communication Graphs: A New Graph Theoretic Model Mapping and Scheduling in Distributed Memory Systems","authors":"V. Lo","doi":"10.1109/DMCC.1991.633137","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633137","url":null,"abstract":"The temporal communication graph is a new graph theoretic model of parallel computation that we have developed for the mapping of parallel computations to message-passing parallel architectures. The TCG integrates the two dominant models currently in use in the area of mapping, task assignment, partitioning, and scheduling: the static task graph and the DAG. The TCG augments these models with the capability to identify logically synchronous phases of communication and computation, and to describe the temporal behavior of a parallel algorithm in terms of these phases. This paper defines the TCG, illustrates its use for mapping and scheduling, and discusses a wide range of potentials uses for the TCG in the area of parallel programming environments.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122756296","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Apply: A Parallel Compiler on iWarp for Image-Processing Applications 应用:一个并行编译器上的iWarp图像处理应用程序
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633119
B. Baxter, B. Greer
Local operator compurafions used in 2-dimensional image processing can be applied TO individual pixels independenr­ ly. making ir easy ro perform This class ofproblems in paral­ lel. The Apply language is designed to exploir rhis OppOrlU­ niry while hiding mosr parallel programming dewilsfrom the programmer andrewining rhe lookandfeel ofa conven­ fional sequential style.
在二维图像处理中使用的局部算子计算可以独立地应用于单个像素。使这类问题的并行求解变得容易。Apply语言的设计目的是利用这种机会,同时对程序员隐藏更多的并行编程细节,并保留传统顺序风格的外观和感觉。
{"title":"Apply: A Parallel Compiler on iWarp for Image-Processing Applications","authors":"B. Baxter, B. Greer","doi":"10.1109/DMCC.1991.633119","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633119","url":null,"abstract":"Local operator compurafions used in 2-dimensional image processing can be applied TO individual pixels independenr­ ly. making ir easy ro perform This class ofproblems in paral­ lel. The Apply language is designed to exploir rhis OppOrlU­ niry while hiding mosr parallel programming dewilsfrom the programmer andrewining rhe lookandfeel ofa conven­ fional sequential style.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"89 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121214046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Using Spanning-Trees for Balancing Dynamic Load on Multiprocessors 用生成树来平衡多处理器的动态负载
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633134
R. Melhem, K. Pruhs, T. Znati
We consider the problem of load balancing to minimize the cost of dynamic computations, including the cost of migrations. We analyze the costs associated with diffusion based algorithms for several common architectures. We introduce the Ripple load balancing paradigm, which has several advantages over diffusion methods, including flexibility and faster convergence.
我们考虑负载平衡问题,以最小化动态计算的成本,包括迁移的成本。我们分析了几种常见架构下基于扩散的算法的相关成本。我们介绍了Ripple负载平衡范式,它比扩散方法有几个优点,包括灵活性和更快的收敛性。
{"title":"Using Spanning-Trees for Balancing Dynamic Load on Multiprocessors","authors":"R. Melhem, K. Pruhs, T. Znati","doi":"10.1109/DMCC.1991.633134","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633134","url":null,"abstract":"We consider the problem of load balancing to minimize the cost of dynamic computations, including the cost of migrations. We analyze the costs associated with diffusion based algorithms for several common architectures. We introduce the Ripple load balancing paradigm, which has several advantages over diffusion methods, including flexibility and faster convergence.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"54 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114073030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Executing Synchronous Data Flow Graphs on Multicomputers 在多台计算机上执行同步数据流程图
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633128
Zhiwei Xu
This paper presents a technique for programming distributed memory multicomputers by automatically generating parallel programs from parallel computations specified as synchronous data flow graphs or recurrence equations.
{"title":"Executing Synchronous Data Flow Graphs on Multicomputers","authors":"Zhiwei Xu","doi":"10.1109/DMCC.1991.633128","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633128","url":null,"abstract":"This paper presents a technique for programming distributed memory multicomputers by automatically generating parallel programs from parallel computations specified as synchronous data flow graphs or recurrence equations.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126469715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Reduction operations on a distributed memory machine with a reconfigurable interconnection network 具有可重构互连网络的分布式存储器机器上的约简操作
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633354
S. Miguet, Yves Robert
Performing reduction operations with distributed memory machines whose interconnection networks are reconfigurable is considered. The focus is on machines whose interconnection graph can be configured as any graph of maximum degree d. The best way of interconnecting the p processors as a function of p,d and some problem- and machine-dependent parameters that characterize the ratio communication/arithmetic for the reduction operation are discussed. Experiments on transputer-based networks are in good accordance with the theoretical results. >
考虑在互连网络可重构的分布式存储机上进行约简运算。重点是那些互连图可以配置为最大度d的任何图的机器。作为p,d和表征约简操作的比率通信/算法的一些问题和机器相关参数的函数,讨论了互连p个处理器的最佳方法。在基于传输器的网络上的实验结果与理论结果吻合良好。>
{"title":"Reduction operations on a distributed memory machine with a reconfigurable interconnection network","authors":"S. Miguet, Yves Robert","doi":"10.1109/DMCC.1991.633354","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633354","url":null,"abstract":"Performing reduction operations with distributed memory machines whose interconnection networks are reconfigurable is considered. The focus is on machines whose interconnection graph can be configured as any graph of maximum degree d. The best way of interconnecting the p processors as a function of p,d and some problem- and machine-dependent parameters that characterize the ratio communication/arithmetic for the reduction operation are discussed. Experiments on transputer-based networks are in good accordance with the theoretical results. >","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125722372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A Real-Time Parallel Algorithm Animation System 一个实时并行算法动画系统
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633314
E. Williams, G.B. Lament
Algorithm animation is a visualization method used to enhance understanding of the functioning of an algorithm or program. Visualization is used for many purposes, including education, algorithm research, performance analysis, and program debugging. This research applies algorithm animation techniques to programs developed for parallel architectures, with specific emphasis on the Intel iPSC/2 hypercube. Current, investigations focus in two different areas: performance data display and animations of specific algorithms or classes of algorithms. This research builds on these efforts to provide a system that is able to both display performance data from parallel programs and support the creation of animations for specific algorithms. There are three goals for this visualization system. Data should be displayed as it is generated. The inteiface to the target program should be transparent, allowing the animation of existing programs. The system must be flexible enough to animate any algorithm. The resulting system incorporates, integrates and extends two systems: the AFIT Algorithm Animation Research Facility (AAARF) and the Parallel Resource Analysis Software Environment (PRASE). Since performance data is an essential part of analyzing any parallel program, multiple views of the performance data are provided as an elementary part of the system. In addition to the animation system, a method for developing the animations is discussed. This method is arpplicable to animating any type of program, sequential or parallel. Whilc: both P-time and NP-ttme algorithms can potentially benefit from using visualization techniques, the set of NP .complete problems provides fertile ground for developing parallel atpplications. The methods discussed in this paper were used to animate a parallel implementation of a general Set Covering Problem (SCP).
算法动画是一种可视化方法,用于增强对算法或程序功能的理解。可视化有很多用途,包括教育、算法研究、性能分析和程序调试。本研究将算法动画技术应用于为并行架构开发的程序,特别强调了英特尔iPSC/2超立方体。目前,研究集中在两个不同的领域:性能数据显示和特定算法或算法类的动画。本研究建立在这些努力的基础上,以提供一个能够显示并行程序的性能数据并支持为特定算法创建动画的系统。这个可视化系统有三个目标。数据应该在生成时显示。目标程序的接口应该是透明的,允许现有程序的动画化。该系统必须足够灵活,以激活任何算法。由此产生的系统包含、集成和扩展了两个系统:AFIT算法动画研究设施(AAARF)和并行资源分析软件环境(PRASE)。由于性能数据是分析任何并行程序的基本部分,因此提供了性能数据的多个视图作为系统的基本部分。除了动画系统外,还讨论了动画的开发方法。这种方法适用于动画的任何类型的程序,顺序或并行。虽然P-time和NP- time算法都可以从使用可视化技术中获益,但NP完全问题集为开发并行应用程序提供了肥沃的土壤。本文所讨论的方法被应用于一般集合覆盖问题(SCP)的并行实现。
{"title":"A Real-Time Parallel Algorithm Animation System","authors":"E. Williams, G.B. Lament","doi":"10.1109/DMCC.1991.633314","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633314","url":null,"abstract":"Algorithm animation is a visualization method used to enhance understanding of the functioning of an algorithm or program. Visualization is used for many purposes, including education, algorithm research, performance analysis, and program debugging. This research applies algorithm animation techniques to programs developed for parallel architectures, with specific emphasis on the Intel iPSC/2 hypercube. Current, investigations focus in two different areas: performance data display and animations of specific algorithms or classes of algorithms. This research builds on these efforts to provide a system that is able to both display performance data from parallel programs and support the creation of animations for specific algorithms. There are three goals for this visualization system. Data should be displayed as it is generated. The inteiface to the target program should be transparent, allowing the animation of existing programs. The system must be flexible enough to animate any algorithm. The resulting system incorporates, integrates and extends two systems: the AFIT Algorithm Animation Research Facility (AAARF) and the Parallel Resource Analysis Software Environment (PRASE). Since performance data is an essential part of analyzing any parallel program, multiple views of the performance data are provided as an elementary part of the system. In addition to the animation system, a method for developing the animations is discussed. This method is arpplicable to animating any type of program, sequential or parallel. Whilc: both P-time and NP-ttme algorithms can potentially benefit from using visualization techniques, the set of NP .complete problems provides fertile ground for developing parallel atpplications. The methods discussed in this paper were used to animate a parallel implementation of a general Set Covering Problem (SCP).","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131952211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Structured Decompositions for Solving Sparse Nonlinear Systems of Equations on Parallel Computers 并行计算机上求解稀疏非线性方程组的结构化分解
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633304
Xiaodong Zhang
The structured decompositions for solving sparse nonlinear systems of equations are to transform the sparse systems into some special structures so that the computations can be decomposed efficiently for parallel processing. A group of structured nonlinear systems by decoinpositions and their parallel methods for solutions are overviewed.
求解稀疏非线性方程组的结构化分解是将稀疏方程组转化为特定的结构,使计算能够有效地分解,便于并行处理。本文综述了一组结构化非线性系统的分解及其并行求解方法。
{"title":"Structured Decompositions for Solving Sparse Nonlinear Systems of Equations on Parallel Computers","authors":"Xiaodong Zhang","doi":"10.1109/DMCC.1991.633304","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633304","url":null,"abstract":"The structured decompositions for solving sparse nonlinear systems of equations are to transform the sparse systems into some special structures so that the computations can be decomposed efficiently for parallel processing. A group of structured nonlinear systems by decoinpositions and their parallel methods for solutions are overviewed.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132216992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Effective Storage and Communication Schemes for Implementation of the Conjugate Gradient Method on an Intel iPSC/860 在Intel iPSC/860上实现共轭梯度法的有效存储和通信方案
Pub Date : 1991-04-28 DOI: 10.1109/DMCC.1991.633309
D. Anderson, L. Sattler
The conjugate gradient method for solving the system of linear equations arising during a finite element analysis has gained renewed interest with the advent of distributed memory computers. In this paper a method will be described which minimizes storage by taking advantage of symmetry and sparsity and minimizes communication overhead by using asynchronous message passing. The data structure necessary to implement this procedure follows naturally frotn the finite element mesh. Test results show near linear speedup for a suflciently large matrix.
随着分布式存储计算机的出现,求解有限元分析中出现的线性方程组的共轭梯度法重新引起了人们的兴趣。本文将描述一种利用对称和稀疏性最小化存储和使用异步消息传递最小化通信开销的方法。实现此过程所需的数据结构自然遵循有限元网格。测试结果表明,对于足够大的矩阵,该算法的加速接近线性。
{"title":"Effective Storage and Communication Schemes for Implementation of the Conjugate Gradient Method on an Intel iPSC/860","authors":"D. Anderson, L. Sattler","doi":"10.1109/DMCC.1991.633309","DOIUrl":"https://doi.org/10.1109/DMCC.1991.633309","url":null,"abstract":"The conjugate gradient method for solving the system of linear equations arising during a finite element analysis has gained renewed interest with the advent of distributed memory computers. In this paper a method will be described which minimizes storage by taking advantage of symmetry and sparsity and minimizes communication overhead by using asynchronous message passing. The data structure necessary to implement this procedure follows naturally frotn the finite element mesh. Test results show near linear speedup for a suflciently large matrix.","PeriodicalId":313314,"journal":{"name":"The Sixth Distributed Memory Computing Conference, 1991. Proceedings","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1991-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133894358","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
The Sixth Distributed Memory Computing Conference, 1991. Proceedings
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1