首页 > 最新文献

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing最新文献

英文 中文
Cluster computing vs. Cray T3E-a case study from numerical field theory 集群计算vs. Cray t3e -一个来自数值场理论的案例研究
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905077
G. Arnold, N. Eicker, T. Lippert, K. Schilling
We compare the performance of a simulation code for lattice quantum electrodynamics, running on the cluster computer ALiCE (Alpha-Linux-Cluster-Engine) and the Cray T3E-1200 system. We present results from simulations using the novel parallelized multicanonical hybrid Monte Carlo algorithm. We merge multicanonical simulation techniques with the hybrid Monte Carlo algorithm to achieve a parallel scheme, and thus to be able to fight the notorious metastabilities by use of high performance parallel computers. We demonstrate for this application field that ALiCE is superior to the Cray T3E-1200 by factors of about 1.3 to 2.
我们比较了运行在集群计算机ALiCE (Alpha-Linux-Cluster-Engine)和Cray T3E-1200系统上的晶格量子电动力学模拟代码的性能。我们给出了使用新的并行多范式混合蒙特卡罗算法的模拟结果。我们将多范式模拟技术与混合蒙特卡罗算法相结合,实现了一种并行方案,从而能够利用高性能并行计算机来对抗臭名昭著的亚稳态。在这个应用领域,我们证明ALiCE比Cray T3E-1200优越约1.3到2倍。
{"title":"Cluster computing vs. Cray T3E-a case study from numerical field theory","authors":"G. Arnold, N. Eicker, T. Lippert, K. Schilling","doi":"10.1109/EMPDP.2001.905077","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905077","url":null,"abstract":"We compare the performance of a simulation code for lattice quantum electrodynamics, running on the cluster computer ALiCE (Alpha-Linux-Cluster-Engine) and the Cray T3E-1200 system. We present results from simulations using the novel parallelized multicanonical hybrid Monte Carlo algorithm. We merge multicanonical simulation techniques with the hybrid Monte Carlo algorithm to achieve a parallel scheme, and thus to be able to fight the notorious metastabilities by use of high performance parallel computers. We demonstrate for this application field that ALiCE is superior to the Cray T3E-1200 by factors of about 1.3 to 2.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129646018","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Towards ubiquitous awareness: the PRAVTA prototype 走向无处不在的意识:PRAVTA原型
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905036
Tom Gross
Despite huge progress in information and communication technology it is often difficult to spontaneously contact persons who are at other locations. This is often due to the fact that important information about the persons at the other sites is missing. We therefore argue that users need awareness-they need to know if the potential communication or cooperation partners are present in the system, if they are available, how busy they are, and so forth. Furthermore, users need this information independently of their current location and adapted to their current context. In this paper we introduce the concept of ubiquitous awareness-a concept that provides users with context-specific information about the presence, availability, and tasks of other persons anytime and anywhere and at the same time allows the system to capture information about users. We present the PRAVTA prototype that exemplifies an implementation of this concept.
尽管信息和通信技术取得了巨大进步,但往往很难自发地与在其他地点的人联系。这通常是由于有关其他地点人员的重要信息丢失。因此,我们认为用户需要意识——他们需要知道系统中是否存在潜在的通信或合作伙伴,他们是否可用,他们有多忙,等等。此外,用户需要独立于其当前位置并适应其当前上下文的这些信息。在本文中,我们介绍了泛在感知的概念——一个随时随地为用户提供关于其他人的存在、可用性和任务的上下文特定信息的概念,同时允许系统捕获关于用户的信息。我们提出的PRAVTA原型,举例说明了这一概念的实现。
{"title":"Towards ubiquitous awareness: the PRAVTA prototype","authors":"Tom Gross","doi":"10.1109/EMPDP.2001.905036","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905036","url":null,"abstract":"Despite huge progress in information and communication technology it is often difficult to spontaneously contact persons who are at other locations. This is often due to the fact that important information about the persons at the other sites is missing. We therefore argue that users need awareness-they need to know if the potential communication or cooperation partners are present in the system, if they are available, how busy they are, and so forth. Furthermore, users need this information independently of their current location and adapted to their current context. In this paper we introduce the concept of ubiquitous awareness-a concept that provides users with context-specific information about the presence, availability, and tasks of other persons anytime and anywhere and at the same time allows the system to capture information about users. We present the PRAVTA prototype that exemplifies an implementation of this concept.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"88 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127019792","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
A parallel compact genetic algorithm for multi-FPGA partitioning 多fpga分区的并行紧凑遗传算法
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905033
R. Baraglia, R. Perego, J. Hidalgo, J. Lanchares, F. Tirado
In this paper we investigate the design of a compact genetic algorithm to solve multi-FPGA partitioning problems. Nowadays Multi-FPGA systems are used for a great variety of applications such as dynamically reconfigurable hardware applications, digital circuit emulation, and numerical computation. Both a sequential and a parallel version of a compact genetic algorithm (cGA) have been designed and implemented on a cluster of workstations. The peculiarities of the cGA permits to save memory in order to address large multi-FPGA partitioning problems, while the exploitation of parallelism allows to reduce execution times. The good results achieved on several experiments conducted on different multi-FPGA partitioning instances show that this solution is viable to solve multi-FPGA partitioning problems.
本文研究了一种紧凑的遗传算法来解决多fpga分区问题。目前,多fpga系统已广泛应用于硬件动态重构、数字电路仿真和数值计算等领域。设计并实现了一种紧凑遗传算法(cGA)的顺序和并行版本。cGA的特性允许节省内存以解决大型多fpga分区问题,同时利用并行性可以减少执行时间。在不同的多fpga分区实例上进行了多次实验,取得了良好的结果,表明该方案是解决多fpga分区问题的可行方案。
{"title":"A parallel compact genetic algorithm for multi-FPGA partitioning","authors":"R. Baraglia, R. Perego, J. Hidalgo, J. Lanchares, F. Tirado","doi":"10.1109/EMPDP.2001.905033","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905033","url":null,"abstract":"In this paper we investigate the design of a compact genetic algorithm to solve multi-FPGA partitioning problems. Nowadays Multi-FPGA systems are used for a great variety of applications such as dynamically reconfigurable hardware applications, digital circuit emulation, and numerical computation. Both a sequential and a parallel version of a compact genetic algorithm (cGA) have been designed and implemented on a cluster of workstations. The peculiarities of the cGA permits to save memory in order to address large multi-FPGA partitioning problems, while the exploitation of parallelism allows to reduce execution times. The good results achieved on several experiments conducted on different multi-FPGA partitioning instances show that this solution is viable to solve multi-FPGA partitioning problems.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131852441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
History-based weighted average voter: a novel software voting algorithm for fault-tolerant computer systems 基于历史的加权平均投票人:一种新的容错计算机系统软件投票算法
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905068
G. L. Shabgahi, J. Bass, S. Bennett
Voting algorithms have been widely used in the realisation of fault-tolerant systems. We propose a new software voting algorithm which uses the history record of redundant modules to compute the final output. Two versions for the novel algorithm are introduced In the first version any module result is assigned to a weighting value such that module results with higher history record value, are assigned to a higher weighing value than those with lower history record value. In the second version of the novel voter, those module results which have a history record value, less than the average record value, are allocated a weight of zero and removed from the contribution toward the voter output. Furthermore, a novel method for creation of a history record of modules is proposed. Empirical results show that both versions of the novel voter give higher safety performance than the Standard Weighted Average voter with permanent and transient errors.
投票算法在容错系统的实现中有着广泛的应用。提出了一种利用冗余模块历史记录计算最终输出的软件投票算法。介绍了新算法的两个版本,在第一个版本中,任何模块结果都被赋予一个权重值,使得历史记录值较高的模块结果被赋予比历史记录值较低的模块结果更高的权重值。在新投票人的第二个版本中,那些历史记录值小于平均记录值的模块结果被分配为零权重,并从对投票人输出的贡献中删除。此外,还提出了一种创建模块历史记录的新方法。实证结果表明,两种版本的新投票人都比具有永久和瞬态误差的标准加权平均投票人具有更高的安全性能。
{"title":"History-based weighted average voter: a novel software voting algorithm for fault-tolerant computer systems","authors":"G. L. Shabgahi, J. Bass, S. Bennett","doi":"10.1109/EMPDP.2001.905068","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905068","url":null,"abstract":"Voting algorithms have been widely used in the realisation of fault-tolerant systems. We propose a new software voting algorithm which uses the history record of redundant modules to compute the final output. Two versions for the novel algorithm are introduced In the first version any module result is assigned to a weighting value such that module results with higher history record value, are assigned to a higher weighing value than those with lower history record value. In the second version of the novel voter, those module results which have a history record value, less than the average record value, are allocated a weight of zero and removed from the contribution toward the voter output. Furthermore, a novel method for creation of a history record of modules is proposed. Empirical results show that both versions of the novel voter give higher safety performance than the Standard Weighted Average voter with permanent and transient errors.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128459644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Dynamic code management on a Java multicomputer Java多计算机上的动态代码管理
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905025
P. Sage, P. Milligan, A. Bouridane
It is clear that writing software for parallel architectures is a non-trivial process. This has encouraged much research in an effort to provide tools to assist parallel software development. However, while these tools may cater for architecture-specific problems, they do little for the concept of parallel software engineering, as the end product is usually neither scaleable nor portable. The introduction of a level of abstraction in the expression of parallel algorithms can elevate the reasoning process above architectural constraints and assist the production of more flexible code. This paper outlines an object-oriented parallel algorithm development paradigm based on a task and channel notation, and examines the utilisation of Java TM technologies in the development of a distributed Java TM virtual machine architecture on which algorithms expressed in this notation may be executed dynamically.
很明显,为并行架构编写软件是一个非常重要的过程。这鼓励了许多研究,以努力提供辅助并行软件开发的工具。然而,尽管这些工具可以解决特定于体系结构的问题,但它们对并行软件工程的概念几乎没有帮助,因为最终产品通常既不可伸缩也不可移植。在并行算法的表达中引入抽象级别可以将推理过程提升到架构约束之上,并有助于生成更灵活的代码。本文概述了一个基于任务和通道表示法的面向对象并行算法开发范例,并研究了Java TM技术在分布式Java TM虚拟机架构开发中的应用,在该架构上,用这种表示法表达的算法可以动态执行。
{"title":"Dynamic code management on a Java multicomputer","authors":"P. Sage, P. Milligan, A. Bouridane","doi":"10.1109/EMPDP.2001.905025","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905025","url":null,"abstract":"It is clear that writing software for parallel architectures is a non-trivial process. This has encouraged much research in an effort to provide tools to assist parallel software development. However, while these tools may cater for architecture-specific problems, they do little for the concept of parallel software engineering, as the end product is usually neither scaleable nor portable. The introduction of a level of abstraction in the expression of parallel algorithms can elevate the reasoning process above architectural constraints and assist the production of more flexible code. This paper outlines an object-oriented parallel algorithm development paradigm based on a task and channel notation, and examines the utilisation of Java TM technologies in the development of a distributed Java TM virtual machine architecture on which algorithms expressed in this notation may be executed dynamically.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128744651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Regular d-neighbours topologies 规则的d邻居拓扑
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905004
R. Trobec, Uros Jovanovic
Topological parameters and routing performances of regular d-neighbours topologies are analysed and discussed in this paper. Average and maximal distances between nodes are compared for hypercubes, 3D-tori, arrangement graphs and recently introduced d-meshes, an extended family of two-dimensional regular meshes of an arbitrary degree d and an isomorphic node neighbourhood. d-meshes are superior in node-to-node distances, in the expansion scalability and in the potential for a planar implementation.
对正则d邻拓扑的拓扑参数和路由性能进行了分析和讨论。比较了超立方体、3d环面、排列图和最近引入的d-网格(任意度d的二维规则网格的扩展家族和同构节点邻域)节点之间的平均和最大距离。d-mesh在节点到节点的距离、扩展的可扩展性和平面实现的潜力方面都具有优势。
{"title":"Regular d-neighbours topologies","authors":"R. Trobec, Uros Jovanovic","doi":"10.1109/EMPDP.2001.905004","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905004","url":null,"abstract":"Topological parameters and routing performances of regular d-neighbours topologies are analysed and discussed in this paper. Average and maximal distances between nodes are compared for hypercubes, 3D-tori, arrangement graphs and recently introduced d-meshes, an extended family of two-dimensional regular meshes of an arbitrary degree d and an isomorphic node neighbourhood. d-meshes are superior in node-to-node distances, in the expansion scalability and in the potential for a planar implementation.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116105888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Modeling the behaviour of linear algebra algorithms with message-passing 用消息传递对线性代数算法的行为进行建模
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905054
J. Cuenca, D. Giménez, José González
Modeling the behaviour of linear algebra algorithms is very suitable for designing linear algebra software for high performance computers. This modelization would enable us to predict the execution time of the routines depending on a number of parameters. There are two groups of parameters, in the first, there are the parameters whose values can be chosen by the user: number of processors, processors grid configuration, distribution of data in the system, block size; and in the second, we have the parameters that specify the characteristics of a target architecture: arithmetic cost and start-up and word-sending cost of a communication operation. Thus, a linear algebra library could be designed in such a way that each routine takes the values of the parameters of the first group that provide the expected optimum execution time, and solves the problem. This library could, therefore be employed by a non-expert user to solve scientific or engineering problems, because the user does not need to determine the values of these parameters. The design methodology is analysed with one-sided block Jacobi methods to solve the symmetric eigenvalue problem. Variants for a logical ring and a logical rectangular mesh of processors are considered. An analytical model of the algorithm is developed, and the behaviour of the algorithm is analysed with message-passing using MPI in a SGI Origin 2000. With the parameters chosen by our model, the execution time is reduced from about 50% higher than the optimal to just 2%.
对线性代数算法的行为进行建模非常适合设计用于高性能计算机的线性代数软件。这种建模将使我们能够根据一些参数预测例程的执行时间。有两组参数,第一组参数的值可由用户选择:处理器数量、处理器网格配置、数据在系统中的分布、块大小;在第二部分中,我们有指定目标体系结构特征的参数:通信操作的算术代价、启动和发送字的代价。因此,可以这样设计线性代数库:每个例程都取第一组提供预期最佳执行时间的参数值,并解决问题。因此,这个库可以被非专业用户用来解决科学或工程问题,因为用户不需要确定这些参数的值。用单侧块雅可比法分析了对称特征值问题的设计方法。考虑了逻辑环和逻辑矩形处理器网格的变体。建立了该算法的解析模型,并在SGI Origin 2000中使用MPI进行消息传递,分析了该算法的行为。使用我们的模型选择的参数,执行时间从比最优值高约50%减少到仅2%。
{"title":"Modeling the behaviour of linear algebra algorithms with message-passing","authors":"J. Cuenca, D. Giménez, José González","doi":"10.1109/EMPDP.2001.905054","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905054","url":null,"abstract":"Modeling the behaviour of linear algebra algorithms is very suitable for designing linear algebra software for high performance computers. This modelization would enable us to predict the execution time of the routines depending on a number of parameters. There are two groups of parameters, in the first, there are the parameters whose values can be chosen by the user: number of processors, processors grid configuration, distribution of data in the system, block size; and in the second, we have the parameters that specify the characteristics of a target architecture: arithmetic cost and start-up and word-sending cost of a communication operation. Thus, a linear algebra library could be designed in such a way that each routine takes the values of the parameters of the first group that provide the expected optimum execution time, and solves the problem. This library could, therefore be employed by a non-expert user to solve scientific or engineering problems, because the user does not need to determine the values of these parameters. The design methodology is analysed with one-sided block Jacobi methods to solve the symmetric eigenvalue problem. Variants for a logical ring and a logical rectangular mesh of processors are considered. An analytical model of the algorithm is developed, and the behaviour of the algorithm is analysed with message-passing using MPI in a SGI Origin 2000. With the parameters chosen by our model, the execution time is reduced from about 50% higher than the optimal to just 2%.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"51 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116772614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Message-passing computing with Java: performance evaluation and comparisons 使用Java的消息传递计算:性能评估和比较
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905040
V. Getov, M. Williams, Quanming Lu, M. Thomas
The development of Java has seen increasing attention as the most popular platform for distributed computing. However, despite Java's advantages in the area of portability and rapid prototyping, its efficiency is unavoidably compromised through its commitment to portability. In this paper we present performance analysis and comparisons of evaluation results for both Java and C/Fortran on three different message-passing parallel platforms-a shared memory multi-processor (Sun E4000), a Linux cluster, and a distributed memory computer (IBM SP-2). The NAS Embarrassingly Parallel and Integer Sort benchmarks were selected for this evaluation. Both the original Fortran/C codes and Java versions of these two kernels were used for obtaining the performance measurements as part of our project. The evaluation results demonstrate the feasibility of message-passing computing with Java on a wide range of computer platforms. Depending on the system and the software components installed, significant impact on the message-passing performance will have the efficiency of the native MPI library and the version of the Java platform.
Java作为分布式计算最流行的平台,其发展受到了越来越多的关注。然而,尽管Java在可移植性和快速原型方面具有优势,但由于其对可移植性的承诺,其效率不可避免地受到损害。在本文中,我们对Java和C/Fortran在三种不同的消息传递并行平台(共享内存多处理器(Sun E4000)、Linux集群和分布式内存计算机(IBM SP-2))上的性能分析和评估结果进行了比较。NAS窘迫并行和整数排序基准被选择用于本次评估。作为我们项目的一部分,这两个内核的原始Fortran/C代码和Java版本都用于获得性能度量。评估结果表明,在广泛的计算机平台上使用Java进行消息传递计算是可行的。根据所安装的系统和软件组件的不同,本机MPI库的效率和Java平台的版本会对消息传递性能产生重大影响。
{"title":"Message-passing computing with Java: performance evaluation and comparisons","authors":"V. Getov, M. Williams, Quanming Lu, M. Thomas","doi":"10.1109/EMPDP.2001.905040","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905040","url":null,"abstract":"The development of Java has seen increasing attention as the most popular platform for distributed computing. However, despite Java's advantages in the area of portability and rapid prototyping, its efficiency is unavoidably compromised through its commitment to portability. In this paper we present performance analysis and comparisons of evaluation results for both Java and C/Fortran on three different message-passing parallel platforms-a shared memory multi-processor (Sun E4000), a Linux cluster, and a distributed memory computer (IBM SP-2). The NAS Embarrassingly Parallel and Integer Sort benchmarks were selected for this evaluation. Both the original Fortran/C codes and Java versions of these two kernels were used for obtaining the performance measurements as part of our project. The evaluation results demonstrate the feasibility of message-passing computing with Java on a wide range of computer platforms. Depending on the system and the software components installed, significant impact on the message-passing performance will have the efficiency of the native MPI library and the version of the Java platform.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130442282","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Probabilistic communication optimizations and parallelization for distributed-memory systems 分布式内存系统的概率通信优化和并行化
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905042
E. Mehofer, Bernhard Scholz
In high-performance systems execution time is of crucial importance justifying advanced optimization techniques. Traditionally, optimization is based on static program analysis. The quality of program optimizations, however, can be substantially improved by utilizing runtime information. Probabilistic data-flow frameworks compute the probability with what data-flow facts may hold at some program point based on representative profile runs. Advanced optimizations can use this information in order to produce highly efficient code. In this paper we introduce a novel optimization technique in the context of High Performance Fortran (HPF) that is based on probabilistic data-flow information. We consider statically undefined attributes which play an important role for parallelization and compute for those attributes the probabilities to hold some specific value during runtime. For the most probable attribute values highly-optimized, specialized code is generated. In this way significantly better performance results can be achieved. The implementation of our optimization is done in the context of VFC, a source-to-source parallelizing compiler for HPF/F90.
在高性能系统中,执行时间是证明先进优化技术的关键。传统上,优化是基于静态程序分析的。然而,程序优化的质量可以通过利用运行时信息得到很大的提高。概率数据流框架根据代表性概要文件运行计算数据流事实在某个程序点上可能存在的概率。高级优化可以使用这些信息来生成高效的代码。本文介绍了一种基于概率数据流信息的高性能Fortran (HPF)环境下的新型优化技术。我们考虑对并行化起重要作用的静态未定义属性,并计算这些属性在运行时保持某个特定值的概率。对于高度优化的最可能的属性值,生成专门的代码。通过这种方式可以获得明显更好的性能结果。我们的优化实现是在VFC上下文中完成的,VFC是一个用于HPF/F90的源对源并行编译器。
{"title":"Probabilistic communication optimizations and parallelization for distributed-memory systems","authors":"E. Mehofer, Bernhard Scholz","doi":"10.1109/EMPDP.2001.905042","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905042","url":null,"abstract":"In high-performance systems execution time is of crucial importance justifying advanced optimization techniques. Traditionally, optimization is based on static program analysis. The quality of program optimizations, however, can be substantially improved by utilizing runtime information. Probabilistic data-flow frameworks compute the probability with what data-flow facts may hold at some program point based on representative profile runs. Advanced optimizations can use this information in order to produce highly efficient code. In this paper we introduce a novel optimization technique in the context of High Performance Fortran (HPF) that is based on probabilistic data-flow information. We consider statically undefined attributes which play an important role for parallelization and compute for those attributes the probabilities to hold some specific value during runtime. For the most probable attribute values highly-optimized, specialized code is generated. In this way significantly better performance results can be achieved. The implementation of our optimization is done in the context of VFC, a source-to-source parallelizing compiler for HPF/F90.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"447 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134311973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Prediction of communication performance for wide area computing systems 广域计算系统通信性能预测
Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905078
Jörg Schulz, C. Hochberger, D. Tavangarian
This paper presents a new algorithm to predict the communication performance for high performance parallel computing in wide area networks. Such a powerful prediction allows to balance the requirements of communication time and computation time. Such a system can be used in almost any wide area computing application. The major feature of this algorithm is the consideration of the human rhythm of life. Prediction values are calculated using a selected number of arguments, depending on daytime and weekday. Thus it is possible to give very accurate prediction values several weeks ahead. In this paper we discuss the usage of this method in the hypercomputer at the University of Restock. The prediction algorithm and monitoring tools are written in Java and can thus be used on almost any platform.
提出了一种用于广域网中高性能并行计算通信性能预测的新算法。这种强大的预测可以平衡通信时间和计算时间的需求。这样的系统可以用于几乎任何广域计算应用。该算法的主要特点是考虑了人类的生活节奏。预测值是根据白天和工作日的不同,使用选定的参数数来计算的。因此,可以提前几周给出非常准确的预测值。在本文中,我们讨论了这种方法在Restock大学的超级计算机中的应用。预测算法和监控工具是用Java编写的,因此几乎可以在任何平台上使用。
{"title":"Prediction of communication performance for wide area computing systems","authors":"Jörg Schulz, C. Hochberger, D. Tavangarian","doi":"10.1109/EMPDP.2001.905078","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905078","url":null,"abstract":"This paper presents a new algorithm to predict the communication performance for high performance parallel computing in wide area networks. Such a powerful prediction allows to balance the requirements of communication time and computation time. Such a system can be used in almost any wide area computing application. The major feature of this algorithm is the consideration of the human rhythm of life. Prediction values are calculated using a selected number of arguments, depending on daytime and weekday. Thus it is possible to give very accurate prediction values several weeks ahead. In this paper we discuss the usage of this method in the hypercomputer at the University of Restock. The prediction algorithm and monitoring tools are written in Java and can thus be used on almost any platform.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130110069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1