首页 > 最新文献

Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing最新文献

英文 中文
2-D wavelet packet decomposition on multicomputers 多机二维小波包分解
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823430
M. Feil, A. Uhl
In this work we describe and analyze algorithms for 2-D wavelet packet decomposition for MIMD distributed memory architectures. The main goal is the generalization of former parallel WP algorithms which are constrained to a number of processor elements equal to a power of 4. We discuss several optimizations and generalizations of data parallel message passing algorithms and finally compare the results obtained on a Cray T3D.
在这项工作中,我们描述和分析了MIMD分布式存储体系结构的二维小波包分解算法。主要目标是推广以前的并行WP算法,这些算法被限制在等于4次幂的处理器元素的数量上。讨论了几种数据并行消息传递算法的优化和推广,最后比较了在Cray T3D上得到的结果。
{"title":"2-D wavelet packet decomposition on multicomputers","authors":"M. Feil, A. Uhl","doi":"10.1109/EMPDP.2000.823430","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823430","url":null,"abstract":"In this work we describe and analyze algorithms for 2-D wavelet packet decomposition for MIMD distributed memory architectures. The main goal is the generalization of former parallel WP algorithms which are constrained to a number of processor elements equal to a power of 4. We discuss several optimizations and generalizations of data parallel message passing algorithms and finally compare the results obtained on a Cray T3D.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116731240","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Specification-driven monitoring of TCP/IP 规范驱动的TCP/IP监控
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823387
R. Hofmann, F. Lemmen
Specification-driven monitoring is a novel technique for systematically analyzing the functional and temporal behavior of a system starting from specification to the implementation with the help of monitoring. This paper briefly shows the method and the tools belonging to it. The main part comprises a measurement study of a TCP/IP protocol stack fully specified in SDL. This study shows, how the TCP protocol stack was analyzed and improved in terms of correctness and performance. After correcting a difficult error in the runtime system, the throughput of the system improved by a factor of 10.
规范驱动的监控是一种借助监控系统地分析系统从规范到实现的功能和时间行为的新技术。本文简要介绍了该方法及其所使用的工具。主要部分是对SDL中完全指定的TCP/IP协议栈的测量研究。本研究展示了如何分析TCP协议栈,并在正确性和性能方面进行改进。在纠正了运行时系统中的一个严重错误后,系统的吞吐量提高了10倍。
{"title":"Specification-driven monitoring of TCP/IP","authors":"R. Hofmann, F. Lemmen","doi":"10.1109/EMPDP.2000.823387","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823387","url":null,"abstract":"Specification-driven monitoring is a novel technique for systematically analyzing the functional and temporal behavior of a system starting from specification to the implementation with the help of monitoring. This paper briefly shows the method and the tools belonging to it. The main part comprises a measurement study of a TCP/IP protocol stack fully specified in SDL. This study shows, how the TCP protocol stack was analyzed and improved in terms of correctness and performance. After correcting a difficult error in the runtime system, the throughput of the system improved by a factor of 10.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128634956","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Specification for reactive bulk-synchronous programming 响应式批量同步编程规范
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823411
Yifeng Chen
Extends bulk-synchronous programming (BSP) to incorporate reactive (i.e. non-terminating) programming. We propose a semantic model for BSP which allows a process to have infinitely many supersteps. The semantics reveals the essential difference between BSP and sequential specifications. Based on the model, a specification language, called the Super-Step Specification (SSS) language, is proposed to support modularised programming and to hide communication and synchronisation details in specifications. The notion of a public variable is proposed in order to substantially simplify reactive programming. The normal forms of BSP and SSS are identified, and complete sets of laws for the two languages are given. Finally, a few refinement laws are used to provide a BSP treatment of the dining philosophers problem, which illustrates the power of BSP reactive programming. Much of the formalism presented in this paper can also be applied to non-reactive programming.
扩展大容量同步编程(BSP),以合并响应式(即非终止)编程。我们提出了一个允许过程具有无限多个超步骤的BSP语义模型。语义揭示了BSP和顺序规范之间的本质区别。在该模型的基础上,提出了一种支持模块化编程和隐藏规范中通信和同步细节的规范语言,称为超级步骤规范(Super-Step specification, SSS)语言。公共变量的概念是为了大大简化响应式编程而提出的。确定了BSP和SSS的标准形式,并给出了两种语言的完整规则。最后,使用了一些改进定律来提供用餐哲学家问题的BSP处理,这说明了BSP反应性规划的强大功能。本文中提出的许多形式化方法也可以应用于非响应式编程。
{"title":"Specification for reactive bulk-synchronous programming","authors":"Yifeng Chen","doi":"10.1109/EMPDP.2000.823411","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823411","url":null,"abstract":"Extends bulk-synchronous programming (BSP) to incorporate reactive (i.e. non-terminating) programming. We propose a semantic model for BSP which allows a process to have infinitely many supersteps. The semantics reveals the essential difference between BSP and sequential specifications. Based on the model, a specification language, called the Super-Step Specification (SSS) language, is proposed to support modularised programming and to hide communication and synchronisation details in specifications. The notion of a public variable is proposed in order to substantially simplify reactive programming. The normal forms of BSP and SSS are identified, and complete sets of laws for the two languages are given. Finally, a few refinement laws are used to provide a BSP treatment of the dining philosophers problem, which illustrates the power of BSP reactive programming. Much of the formalism presented in this paper can also be applied to non-reactive programming.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"115 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124029365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Monitoring and debugging message passing applications with MPVisualizer 监视和调试消息传递应用程序与MPVisualizer
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823433
A. Cláudio, J. D. Cunha, M. B. Carmo
MPVisualizer (Message Passing Visualizer) is a tool for the monitoring and debugging of message passing parallel applications with three components: the trace/replay mechanism, the graphical user interface and a central component, called visualization engine. The engine, which plays the main role during the replay phase, builds an object-oriented model of the application. Taking full advantage of inheritance and polymorphism the tool can be adapted to different message passing environments and different graphical environments, and easily reprogrammed to detect specific predicates. The engine is also prepared to recognize race conditions.
MPVisualizer (Message Passing Visualizer)是一个用于监视和调试消息传递并行应用程序的工具,它有三个组件:跟踪/重播机制、图形用户界面和一个称为可视化引擎的中心组件。该引擎在重播阶段起主要作用,它构建应用程序的面向对象模型。充分利用继承和多态性,该工具可以适应不同的消息传递环境和不同的图形化环境,并且可以轻松地重新编程以检测特定的谓词。发动机还准备识别比赛条件。
{"title":"Monitoring and debugging message passing applications with MPVisualizer","authors":"A. Cláudio, J. D. Cunha, M. B. Carmo","doi":"10.1109/EMPDP.2000.823433","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823433","url":null,"abstract":"MPVisualizer (Message Passing Visualizer) is a tool for the monitoring and debugging of message passing parallel applications with three components: the trace/replay mechanism, the graphical user interface and a central component, called visualization engine. The engine, which plays the main role during the replay phase, builds an object-oriented model of the application. Taking full advantage of inheritance and polymorphism the tool can be adapted to different message passing environments and different graphical environments, and easily reprogrammed to detect specific predicates. The engine is also prepared to recognize race conditions.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"740 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127572102","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
Performance and transparency of message passing and DSM services within the GENESIS operating system for managing parallelism on COWs GENESIS操作系统中用于管理奶牛并行性的消息传递和DSM服务的性能和透明度
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823401
A. Goscinski, M. Hobbs, J. Silcock
The primary paradigm for building parallel applications for execution on clusters of workstations (COWs) can be generalised into message passing (MP) and distributed shared memory (DSM). Unfortunately the currently available run-time environments and operating systems do not provide satisfactory levels of transparency, management support, and only support either MP or DSM. We propose a unique and novel approach where the MP and DSM services are provided to the application programmer as a cohesive and comprehensive set of parallel processing servers that are integral components of an operating system. The performance of a number of common parallel applications, employing both MP (raw and PVM based) and DSM, demonstrate the high quality of the proposed approach.
构建用于在工作站集群(奶牛)上执行的并行应用程序的主要范例可以概括为消息传递(MP)和分布式共享内存(DSM)。不幸的是,目前可用的运行时环境和操作系统不能提供令人满意的透明度和管理支持,而且只支持MP或DSM。我们提出了一种独特而新颖的方法,将MP和DSM服务作为一组内聚和全面的并行处理服务器提供给应用程序程序员,这些服务器是操作系统的组成部分。使用MP(基于原始和PVM的)和DSM的许多常见并行应用程序的性能证明了所提出方法的高质量。
{"title":"Performance and transparency of message passing and DSM services within the GENESIS operating system for managing parallelism on COWs","authors":"A. Goscinski, M. Hobbs, J. Silcock","doi":"10.1109/EMPDP.2000.823401","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823401","url":null,"abstract":"The primary paradigm for building parallel applications for execution on clusters of workstations (COWs) can be generalised into message passing (MP) and distributed shared memory (DSM). Unfortunately the currently available run-time environments and operating systems do not provide satisfactory levels of transparency, management support, and only support either MP or DSM. We propose a unique and novel approach where the MP and DSM services are provided to the application programmer as a cohesive and comprehensive set of parallel processing servers that are integral components of an operating system. The performance of a number of common parallel applications, employing both MP (raw and PVM based) and DSM, demonstrate the high quality of the proposed approach.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124067411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
PQE HPF-a library for exploiting the capabilities of a PQE-1 heterogeneous parallel architecture PQE hpf -用于利用PQE-1异构并行体系结构的功能的库
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823408
R. Baraglia, R. Ferrini, D. Laforenza, P. Palmerini, R. Perego
Heterogeneous computing is a special form of parallel and distributed computing where computations are performed using a single autonomous computer operating in both SIMD and MIMD modes, or using a number of connected autonomous computers. In multimode system heterogeneous computing, tasks can be executed in both SIMD and MIMD simultaneously. In this paper, we present PQE HPF, a High Performance Fortran (HPF) based programming library which allows one to exploit the MIMD and SIMD capabilities offered by PQE-1, a multimode parallel architecture. Two different implementations of a well-known application, using HPF and PQE HPF respectively, were used to evaluate the overheads introduced over the machine's runtime system. Preliminary tests, conducted by running the case study application on the first PQE-1 prototype, show good results and encourage us to dedicate more effort to implement real production parallel codes on a similar architecture.
异构计算是并行和分布式计算的一种特殊形式,其中使用在SIMD和MIMD模式下运行的单个自主计算机或使用许多连接的自主计算机执行计算。在多模系统异构计算中,任务可以同时在SIMD和MIMD中执行。在本文中,我们提出了PQE HPF,一个基于高性能Fortran (HPF)的编程库,它允许人们利用PQE-1提供的多模式并行架构的MIMD和SIMD功能。一个知名应用程序的两种不同实现(分别使用HPF和PQE HPF)被用来评估在机器运行时系统上引入的开销。通过在第一个PQE-1原型上运行案例研究应用程序进行的初步测试显示出良好的结果,并鼓励我们投入更多的精力在类似架构上实现真正的生产并行代码。
{"title":"PQE HPF-a library for exploiting the capabilities of a PQE-1 heterogeneous parallel architecture","authors":"R. Baraglia, R. Ferrini, D. Laforenza, P. Palmerini, R. Perego","doi":"10.1109/EMPDP.2000.823408","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823408","url":null,"abstract":"Heterogeneous computing is a special form of parallel and distributed computing where computations are performed using a single autonomous computer operating in both SIMD and MIMD modes, or using a number of connected autonomous computers. In multimode system heterogeneous computing, tasks can be executed in both SIMD and MIMD simultaneously. In this paper, we present PQE HPF, a High Performance Fortran (HPF) based programming library which allows one to exploit the MIMD and SIMD capabilities offered by PQE-1, a multimode parallel architecture. Two different implementations of a well-known application, using HPF and PQE HPF respectively, were used to evaluate the overheads introduced over the machine's runtime system. Preliminary tests, conducted by running the case study application on the first PQE-1 prototype, show good results and encourage us to dedicate more effort to implement real production parallel codes on a similar architecture.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127989233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
First steps in metacomputing with Amica 与美国一起迈出元计算的第一步
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823412
T. Fink, S. Kindermann
The metacomputing system Amica is a new approach to support the development of coarse-grained applications for distributed dynamic heterogeneous systems (e.g. computers linked to the Internet). It aims at the location-transparent and convenient design of distributed applications and at the easy integration of legacy systems. Applications are described in the form of application graphs based on a predefined set of reusable components and connectors. This graph is dynamically interpreted using the Amica infrastructure. Amica provides uniform access to computational resources using the well-known factory pattern. Additionally, a memory subsystem supports the location-transparent use of complex data objects which may be replicated to increase access speed. To transfer data, specific network resources can be used. We report initial experiences with using Amica for a computationally intensive real-world problem: the parallel simulation of cellular mobile systems. Measurements show that Amica, even in its premature stage, provides a convenient interface and sufficient efficiency to build distributed applications utilizing heterogeneous dynamic resources.
元计算系统Amica是一种支持为分布式动态异构系统(例如连接到Internet的计算机)开发粗粒度应用程序的新方法。它的目标是分布式应用程序的位置透明和方便的设计,以及遗留系统的容易集成。应用程序以基于一组预定义的可重用组件和连接器的应用程序图的形式进行描述。此图使用美国基础设施进行动态解释。Amica使用众所周知的工厂模式提供对计算资源的统一访问。此外,内存子系统支持复杂数据对象的位置透明使用,这些对象可以被复制以提高访问速度。为了传输数据,可以使用特定的网络资源。我们报告了使用Amica解决计算密集型现实问题的初步经验:蜂窝移动系统的并行模拟。测量表明,即使在早期阶段,Amica也提供了一个方便的接口和足够的效率来构建利用异构动态资源的分布式应用程序。
{"title":"First steps in metacomputing with Amica","authors":"T. Fink, S. Kindermann","doi":"10.1109/EMPDP.2000.823412","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823412","url":null,"abstract":"The metacomputing system Amica is a new approach to support the development of coarse-grained applications for distributed dynamic heterogeneous systems (e.g. computers linked to the Internet). It aims at the location-transparent and convenient design of distributed applications and at the easy integration of legacy systems. Applications are described in the form of application graphs based on a predefined set of reusable components and connectors. This graph is dynamically interpreted using the Amica infrastructure. Amica provides uniform access to computational resources using the well-known factory pattern. Additionally, a memory subsystem supports the location-transparent use of complex data objects which may be replicated to increase access speed. To transfer data, specific network resources can be used. We report initial experiences with using Amica for a computationally intensive real-world problem: the parallel simulation of cellular mobile systems. Measurements show that Amica, even in its premature stage, provides a convenient interface and sufficient efficiency to build distributed applications utilizing heterogeneous dynamic resources.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"73 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121209615","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Groups in bulk synchronous parallel computing 组在批量同步并行计算
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823418
J. González, C. León, F. Piccoli, A. M. Printista, J. R. García, C. Rodríguez, F. D. Sande
An extension to the Bulk Synchronous Parallel Model (BSP) to allow the use of asynchronous BSP groups of processors is presented. In this model, called Nested BSP, processor groups can be divided and processors in a group synchronize through group dependent collective operations generalizing the concept of barrier synchronization. A classification of problems and algorithms attending to their parallel input-output distribution is provided. For one of these problem classes, the called common-common class, we present a general strategy to derive efficient parallel algorithms. Algorithms belonging to this class allow the arbitrary division of the processor subsets, easing the opportunities of the underlying BSP software to divide the network in independent sub networks, minimizing the impact of the traffic in the rest of the network in the predicted cost. The expressiveness of the model is exemplified through three divide and conquer programs. The computational results for these programs in six high performance supercomputers show both the accuracy of the model and the optimality of the speedups for the class of problems considered.
提出了对批量同步并行模型(BSP)的一种扩展,允许使用异步BSP处理器组。在这个称为嵌套BSP的模型中,可以划分处理器组,并且组中的处理器通过组相关的集体操作进行同步,从而推广了屏障同步的概念。给出了涉及其并行输入输出分布的问题和算法的分类。对于其中的一类问题,我们提出了一种通用的策略来推导高效的并行算法。这类算法允许对处理器子集进行任意划分,使底层BSP软件能够将网络划分为独立的子网络,从而最大限度地减少网络中其余部分的流量对预测成本的影响。通过三个分而治之的程序来说明模型的表现力。这些程序在六台高性能超级计算机上的计算结果表明,对于所考虑的这类问题,模型是准确的,加速是最优的。
{"title":"Groups in bulk synchronous parallel computing","authors":"J. González, C. León, F. Piccoli, A. M. Printista, J. R. García, C. Rodríguez, F. D. Sande","doi":"10.1109/EMPDP.2000.823418","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823418","url":null,"abstract":"An extension to the Bulk Synchronous Parallel Model (BSP) to allow the use of asynchronous BSP groups of processors is presented. In this model, called Nested BSP, processor groups can be divided and processors in a group synchronize through group dependent collective operations generalizing the concept of barrier synchronization. A classification of problems and algorithms attending to their parallel input-output distribution is provided. For one of these problem classes, the called common-common class, we present a general strategy to derive efficient parallel algorithms. Algorithms belonging to this class allow the arbitrary division of the processor subsets, easing the opportunities of the underlying BSP software to divide the network in independent sub networks, minimizing the impact of the traffic in the rest of the network in the predicted cost. The expressiveness of the model is exemplified through three divide and conquer programs. The computational results for these programs in six high performance supercomputers show both the accuracy of the model and the optimality of the speedups for the class of problems considered.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"68 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129394391","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A performance simulation technique for distributed programs: application to an SOR iterative solver 分布式程序的性能模拟技术:在SOR迭代求解器中的应用
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823432
Rocco Aversa, B. D. Martino, N. Mazzocca, Umberto Villano
In this paper we present the application of an approach for the performance prediction of message passing programs, to a PVM code implementing an iterative solver based on the Successive OverRelaxation method. The approach, based on the integration of static program analysis and simulation techniques, is aimed at significantly speeding up the time needed for simulating the execution of a message passing program. We show how the proposed technique can provide, in a reasonable elaboration time, the user for a characterization of iterative regular programs as the proposed one, in terms of idle-, cpu-, communication and synchronization time in Heterogeneous and Network Computing environments.
在本文中,我们提出了一种用于消息传递程序性能预测的方法,用于实现基于逐次超松弛方法的迭代求解器的PVM代码。该方法基于静态程序分析和仿真技术的集成,旨在显著加快模拟消息传递程序执行所需的时间。我们展示了所提出的技术如何在合理的细化时间内,为用户提供迭代规则程序的表征,如所提出的程序,在异构和网络计算环境中的空闲时间、cpu时间、通信时间和同步时间。
{"title":"A performance simulation technique for distributed programs: application to an SOR iterative solver","authors":"Rocco Aversa, B. D. Martino, N. Mazzocca, Umberto Villano","doi":"10.1109/EMPDP.2000.823432","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823432","url":null,"abstract":"In this paper we present the application of an approach for the performance prediction of message passing programs, to a PVM code implementing an iterative solver based on the Successive OverRelaxation method. The approach, based on the integration of static program analysis and simulation techniques, is aimed at significantly speeding up the time needed for simulating the execution of a message passing program. We show how the proposed technique can provide, in a reasonable elaboration time, the user for a characterization of iterative regular programs as the proposed one, in terms of idle-, cpu-, communication and synchronization time in Heterogeneous and Network Computing environments.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117231177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalability analysis of parallel systems with multiple components of work 具有多个工作组件的并行系统的可伸缩性分析
Pub Date : 2000-01-19 DOI: 10.1109/EMPDP.2000.823410
E. Tambouris, P. V. Santen
The generic fixed-value efficiency (FVE) method is proposed in order to study the scalability of parallel algorithms with multiple components of work. The generic FVE method is based on the isoefficiency method. Unlike isoefficiency, however, this method may be applied to parallel algorithm-machine combinations (parallel systems) where the relationship between the total work and its components is not predetermined by the decomposition method or by any other factor. The objective of the method is to derive the relationships between the total work and its components in order for the efficiency of the parallel system to be preserved. The use of the method is demonstrated by analysing the impact of the sparsity of the input data on the scalability of a static state estimator for power systems.
为了研究多任务并行算法的可扩展性,提出了通用固定值效率方法。通用的FVE方法是在等效率方法的基础上提出的。然而,与等效率不同的是,这种方法可以应用于并行算法-机器组合(并行系统),其中总功与其分量之间的关系不是由分解方法或任何其他因素预先确定的。该方法的目的是推导出总功与其分量之间的关系,以保持并联系统的效率。通过分析输入数据的稀疏性对电力系统静态估计器可扩展性的影响,证明了该方法的应用。
{"title":"Scalability analysis of parallel systems with multiple components of work","authors":"E. Tambouris, P. V. Santen","doi":"10.1109/EMPDP.2000.823410","DOIUrl":"https://doi.org/10.1109/EMPDP.2000.823410","url":null,"abstract":"The generic fixed-value efficiency (FVE) method is proposed in order to study the scalability of parallel algorithms with multiple components of work. The generic FVE method is based on the isoefficiency method. Unlike isoefficiency, however, this method may be applied to parallel algorithm-machine combinations (parallel systems) where the relationship between the total work and its components is not predetermined by the decomposition method or by any other factor. The objective of the method is to derive the relationships between the total work and its components in order for the efficiency of the parallel system to be preserved. The use of the method is demonstrated by analysing the impact of the sparsity of the input data on the scalability of a static state estimator for power systems.","PeriodicalId":128020,"journal":{"name":"Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2000-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122657529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Proceedings 8th Euromicro Workshop on Parallel and Distributed Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1