Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing最新文献

英文中文

Elimination of redundant messages with a two-pass static analysis algorithm 通过两步静态分析算法消除冗余消息

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905041

A. Girault

Eliminating redundant messages in distributed programs leads to the reduction of communication overhead, and thus to the improvement of the overall performances of the distributed program. Therefore a lot of work has been done recently to achieve this goal. We present in this paper an algorithm for eliminating redundant valued messages in parallel programs that have been distributed automatically. This algorithm works on program whose control flow is as general as possible, i.e., contains gotos. Precisely, the control flow is a finite deterministic automaton with a DAG of actions in each state. our algorithm, proceeds in two passes: First a global data-flow analysis which computes, for each state of the automaton, the set of distant variables that are known at the beginning of the state. Then a local elimination which removes redundant messages locally in each state of the automaton. We present the algorithms along with an example.

消除分布式程序中的冗余消息可以减少通信开销，从而提高分布式程序的整体性能。因此，最近为实现这一目标做了大量工作。本文提出了一种消除并行程序中自动分布的冗余值消息的算法。该算法适用于控制流尽可能一般的程序，即包含gotos的程序。准确地说，控制流是一个有限的确定性自动机，在每个状态下都有一个DAG的动作。我们的算法分两步进行:首先进行全局数据流分析，计算自动机的每个状态，在状态开始时已知的一组远程变量。然后是局部消除，在自动机的每个状态中本地删除冗余消息。我们给出了算法和一个例子。

引用次数: 19

Off-line real-time fault-tolerant scheduling 离线实时容错调度

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905069

C. Dima, A. Girault, C. Lavarenne, Y. Sorel

We address the problem of off-line fault tolerant scheduling of an algorithm onto a multiprocessor architecture with distributed memory and provide a generic algorithm which solves this problem. We take into account two kinds of failures: fail-silent and omission. The basic technique we use is the replication of operations and data communications. We then discuss the principles which govern the execution of schedulings with replication under the state-machine and the primary/backup arbitrations between replicas. We also show how to compute the execution date for each operation and the timeouts which are used for detecting failures. We end with a heuristic which, using this calculus, computes a possibly non optimal scheduling by finding plain schedulings for each failure pattern and then combining them into a scheduling with replication.

我们解决了一种算法在分布式内存的多处理器架构上的脱机容错调度问题，并提供了一种解决该问题的通用算法。我们考虑两种类型的失败:失败沉默和遗漏。我们使用的基本技术是操作和数据通信的复制。然后，我们讨论了在状态机下管理复制调度执行的原则，以及副本之间的主/备份仲裁。我们还将展示如何计算每个操作的执行日期和用于检测故障的超时。最后，我们使用启发式算法，通过为每个故障模式找到简单的调度，然后将它们组合成具有复制的调度，从而计算出可能的非最佳调度。

引用次数: 56

Efficient BSR-based parallel algorithms for geometrical problems 几何问题的高效bsr并行算法

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905052

D. Semé, J. Myoupo

This paper presents BSR-parallel algorithms for three geometrical problems: point location, convex hull and smallest enclosing rectangle. These problems are solved in constant time using the BSR model introduced by Akl and Guenther in 1989. The first algorithm uses O(N) processors (N is the number of edges of the polygon R). The second uses O(N'/sup 2/) processors (N' is the number of points) and the third one uses O(N'/sup 2/) processors (it need the convex hull) to solve the smallest enclosing rectangle problem. These new results suggest that many other geometrical problems can be solved in constant time using the BSR model.

针对点定位、凸包和最小包围矩形三个几何问题，提出了bsr并行算法。利用Akl和Guenther在1989年引入的BSR模型，在常数时间内解决了这些问题。第一种算法使用O(N)个处理器(N是多边形R的边数)，第二种算法使用O(N'/sup 2/)个处理器(N'是点的数量)，第三种算法使用O(N'/sup 2/)个处理器(它需要凸包)来解决最小的封闭矩形问题。这些新的结果表明，许多其他几何问题可以用BSR模型在常数时间内解决。

引用次数: 2

Integrating HPF in a skeleton based parallel language 在基于并行语言的框架中集成HPF

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905043

C. Gennaro, R. Perego, S. Orlando

Although HPF allows programmers to express data-parallel computations in a portable, high-level way, it is widely accepted that many important parallel applications cannot be efficiently implemented following a pure data-parallel paradigm. For these applications, rather than having a single data-parallel program, it is more profitable to subdivide the whole computation into several data-parallel pieces, where the various pieces run concurrently and co-operate, thus exploiting task parallelism. This paper discusses the integration of HPF with SkIE, a skeleton based coordination language implemented on top of MPI (Message Passing Interface), which permits to describe complex computational parallel structures. We show how HPF can be used inside common forms of parallelism, e.g. pipeline and processor farms, and we present experimental results regarding a sample application.

尽管HPF允许程序员以可移植的高级方式表达数据并行计算，但人们普遍认为，许多重要的并行应用程序不能按照纯数据并行范式有效地实现。对于这些应用程序，与其使用单个数据并行程序，不如将整个计算细分为几个数据并行部分，其中各个部分并发运行并合作，从而利用任务并行性。本文讨论了HPF与SkIE的集成，SkIE是一种基于框架的协调语言，实现在MPI(消息传递接口)之上，允许描述复杂的计算并行结构。我们展示了如何在常见的并行形式中使用HPF，例如管道和处理器群，并给出了一个示例应用程序的实验结果。

引用次数: 0

Local-area and wide-area computing: architectures and tools 局域和广域计算:架构和工具

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905006

D. Tavangarian

In this paper, recent architectural approaches and tools for local-area and wide-area computing using clusters of servers, workstations, and PCs as multicomputers (i.e. parallel computing in workstation cluster) are classified and described. The goal of such systems is to concentrate available computing resources to solve computing problems. A special focus of this contribution is a description of recent research in the field of cost-efficient parallel computing with standard component multicomputer systems, concentrating on locally organized clusters for local-area computing and on wide-area multiclusters (hyperclusters or clusters of clusters) for wide-area computing. Selected examples are given demonstrating the improvement through high-speed interconnection networks and optimized protocol system architectures in local-area systems and optimized organizations in wide-area systems.

在本文中，最近的架构方法和工具的局域网和广域计算使用集群的服务器，工作站和pc作为多台计算机(即并行计算在工作站集群)进行分类和描述。这种系统的目标是集中可用的计算资源来解决计算问题。这篇文章的一个特别重点是描述了在标准组件多计算机系统的成本效益并行计算领域的最新研究，集中在局部组织的局部区域计算集群和广域多集群(超集群或集群的集群)的广域计算。选定的实例说明了通过高速互连网络和优化的协议系统架构在局域网系统和优化的组织在广域系统中的改进。

引用次数: 2

SIMLAB-a simulation environment for storage area networks simlab -存储区域网络的仿真环境

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905047

P. Berenbrink, A. Brinkmann, C. Scheideler

In this paper we present a simulation environment for storage area networks called SIMLAB. SIMLAB is a part of the PRESTO project, which is a joint project of the Electrical Engineering Department and the Computer Science Department of the Paderborn University. The aim of the PRESTO project is to construct a scalable and resource-efficient storage network that can support the real-time delivery of data. SIMLAB has been implemented to aid the development and verification of distributed algorithms for this storage network. However, it has been designed in such a way that it can also be used for the simulation of many other types of networking problems. SIMLAB is based on C++ and common libraries and input/output formats, which ensures that SIMLAB can be used on many different platforms. We therefore expect SIMLAB to be useful also for other people working on similar problems.

本文提出了一个存储区域网络的仿真环境SIMLAB。SIMLAB是PRESTO项目的一部分，PRESTO项目是帕德博恩大学电气工程系和计算机科学系的联合项目。PRESTO项目的目标是构建一个可扩展和资源高效的存储网络，可以支持数据的实时传输。SIMLAB已经实现，以帮助开发和验证该存储网络的分布式算法。然而，它的设计方式使得它也可以用于模拟许多其他类型的网络问题。SIMLAB基于c++和通用库以及输入/输出格式，这确保了SIMLAB可以在许多不同的平台上使用。因此，我们希望SIMLAB对处理类似问题的其他人也很有用。

引用次数: 25

On the impact of message packetization in networks of workstations with irregular topology 不规则拓扑工作站网络中消息分组的影响

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.904960

Xavier Molero, F. Silla, V. Santonja, J. Duato

Networks of workstations (NOWs) are becoming an increasingly popular alternative to parallel computers for those applications with high needs of resources such as memory capacity and input/output storage space, and also for small scale parallel computing. Usually, the software messaging layers in these systems become a bottleneck due to the overhead they introduce. Some proposals like FM and BIP considerably reduce this overhead by splitting long messages into several packets. These proposals have been shown to improve communication performance. However, the effect of message packetization on the network interconnects has not been analyzed yet. In this paper we examine the effect of message packetization from the point of view of the interconnection network in the context of bimodal traffic. Two different routing algorithms have been considered: up*/down* and minimal adaptive routing. Our study shows that when the up */down* routing algorithm is used, message packetization dramatically increases latency and reduces throughput for both long and short messages. On the other hand, if minimal adaptive routing is used, short messages could benefit from message packetization, but at the cost of increasing latency for long messages. In any case, network throughput is considerably reduced.

对于内存容量和输入/输出存储空间等资源需求高的应用程序以及小规模并行计算，工作站网络(NOWs)正在成为并行计算机的一种日益流行的替代方案。通常，这些系统中的软件消息传递层由于它们带来的开销而成为瓶颈。一些建议，如FM和BIP，通过将长消息分成几个数据包，大大减少了这种开销。这些建议已被证明可以改善通信性能。然而，报文分组对网络互连的影响尚未得到分析。在本文中，我们从双峰流量背景下的互连网络的角度研究了消息分组的影响。考虑了两种不同的路由算法:上/下和最小自适应路由。我们的研究表明，当使用上/下路由算法时，消息分组大大增加了延迟，并降低了长消息和短消息的吞吐量。另一方面，如果使用最小的自适应路由，短消息可以从消息分组中获益，但代价是增加长消息的延迟。在任何情况下，网络吞吐量都会大大降低。

{"title":"On the impact of message packetization in networks of workstations with irregular topology","authors":"Xavier Molero, F. Silla, V. Santonja, J. Duato","doi":"10.1109/EMPDP.2001.904960","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.904960","url":null,"abstract":"Networks of workstations (NOWs) are becoming an increasingly popular alternative to parallel computers for those applications with high needs of resources such as memory capacity and input/output storage space, and also for small scale parallel computing. Usually, the software messaging layers in these systems become a bottleneck due to the overhead they introduce. Some proposals like FM and BIP considerably reduce this overhead by splitting long messages into several packets. These proposals have been shown to improve communication performance. However, the effect of message packetization on the network interconnects has not been analyzed yet. In this paper we examine the effect of message packetization from the point of view of the interconnection network in the context of bimodal traffic. Two different routing algorithms have been considered: up*/down* and minimal adaptive routing. Our study shows that when the up */down* routing algorithm is used, message packetization dramatically increases latency and reduces throughput for both long and short messages. On the other hand, if minimal adaptive routing is used, short messages could benefit from message packetization, but at the cost of increasing latency for long messages. In any case, network throughput is considerably reduced.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132885297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predictability of cellular programs implemented with CAMELot 可预测性的蜂窝程序实现与CAMELot

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905076

G. Folino, G. Spezzano

In this paper we present a performance model to analyse the scalability and predict the performance of cellular programs developed by the CAMELot system. CAMELot is a problem solving environment that uses the cellular automata model for modelling and simulating dynamic complex phenomena. The environment supports CARPET, a purpose-built language for programming and steering cellular applications. The performance model proposed is based on the isoefficiency method. The isoefficiency is a scalability measure that determines whether a parallel system can preserve its efficiency by increasing the problem size as the number of processors is scaled. By isoefficiency analysis we can test a program's performance on a few processors and then predict its performance on a larger number of processors. It also lets us study system behavior when other hardware parameters, such as processor and communication speeds change. Scalability prediction examples for two-dimensional and three-dimensional cellular programs on a Meiko CS-2 parallel machine are given.

在本文中，我们提出了一个性能模型来分析可扩展性和预测由CAMELot系统开发的蜂窝程序的性能。CAMELot是一个解决问题的环境，它使用元胞自动机模型来建模和模拟动态复杂现象。该环境支持CARPET，这是一种用于编程和控制蜂窝应用程序的专用语言。提出了基于等效率方法的性能模型。等效率是一种可伸缩性度量，用于确定随着处理器数量的增加，并行系统是否可以通过增加问题大小来保持其效率。通过等效率分析，我们可以在少数处理器上测试程序的性能，然后预测其在大量处理器上的性能。它还允许我们研究当其他硬件参数(如处理器和通信速度)发生变化时的系统行为。给出了在Meiko CS-2并行机上二维和三维元胞程序的可扩展性预测实例。

引用次数: 1

Running multithreaded applications in exokernel-based systems: porting CThreads to Xok 在基于exokkernel的系统中运行多线程应用程序:将CThreads移植到Xok

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905018

E. Artiaga, Marisa Gil

Exokernel-based systems provide efficient access to the system actual hardware resources. Parallel applications can take advantage of such kind of access and adapt to the actual resources available to increase performance. In this paper we present an extension to allow multithreaded applications to run on an Intel-based exokernel system. For this purpose, we have ported a user-level threads package to such environment. Our final goal is having a multiprocessor exokernel version to be able to run parallel applications on top of it. We use the exokernel interface to have access to the physical execution resources and we have designed the lower layer of the multithreading library to use them.

基于exokkernel的系统提供了对系统实际硬件资源的有效访问。并行应用程序可以利用这种访问并适应可用的实际资源来提高性能。在本文中，我们提出了一个扩展，允许多线程应用程序在基于intel的exokkernel系统上运行。为此，我们将一个用户级线程包移植到这样的环境中。我们的最终目标是拥有一个多处理器exokkernel版本，以便能够在其上运行并行应用程序。我们使用exokkernel接口来访问物理执行资源，并设计了多线程库的底层来使用它们。

引用次数: 0

Integrating pervasive information acquisition to enhance workspace awareness 集成无处不在的信息获取以增强工作空间意识

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905059

A. Ferscha

Workspace awareness as the "... up-to-the-moment understanding of another person's interaction with a shared workspace" involves knowledge on who is working in the workspace, where individuals are working, what they are doing or going to do, how and when they are executing their work, and what their motivation is for doing it (why). Traditional awareness systems use dynamic user behavior data as collected while monitoring events from I/O devices (keyboard, mouse, touchscreen) at the interface to laser. To preserve and maintain a more intuitive fidelity of awareness, we extend our workspace awareness system TEAMSPACE to collect and exploit awareness information from the users physical activities in his workspace, like hand gesture and body movement, using position and orientation tracking technologies. Thus user activities aside the interaction with desktop computing facilities are seamlessly integrated into a shared virtual workspace opening a whole new dimension of awareness abilities.

工作空间意识作为“…即时了解另一个人与共享工作空间的交互“涉及到谁在工作空间中工作，个人在哪里工作，他们正在做什么或将要做什么，他们如何以及何时执行工作，以及他们这样做的动机是什么(为什么)。传统的感知系统使用动态用户行为数据，这些数据是在监测来自激光接口的I/O设备(键盘、鼠标、触摸屏)的事件时收集的。为了保持和保持更直观的感知保真度，我们扩展了我们的工作空间感知系统TEAMSPACE，使用位置和方向跟踪技术从用户在其工作空间中的物理活动(如手势和身体运动)中收集和利用感知信息。因此，除了与桌面计算设备的交互之外，用户活动无缝地集成到共享的虚拟工作空间中，从而打开了感知能力的全新维度。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀