Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing最新文献

英文中文

Implementing standard and nonstandard parallel cellular automata in CARPET 在CARPET中实现标准和非标准并行元胞自动机

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905049

D. Talia

Cellular automata is a nature inspired parallel processing model. It has been proposed several years ago by J. Von Neumann to simulate complex dynamical processes. In the past two decades several models of cellular automata that differ from the original one proposed by Von Neumann have been defined for modeling real-world systems and phenomena. This paper describes the design and implementation of standard and nonstandard parallel cellular automata in the CARPET language. CARPET is a cellular automata based language that has been implemented on MIMD parallel computers. The language is specifically designed for programming cellular computations supporting concise and efficient coding of parallel cellular algorithms. The paper analyzes the main features of the language and describes as they can be exploited to implement different cellular automata on parallel computers, starting from the standard model to its modifications and generalizations. Inhomogeneous, partitioned, asynchronous, and probabilistic cellular automata programmed in CARPET are presented.

元胞自动机是一种受自然启发的并行处理模型。几年前冯·诺依曼提出了它来模拟复杂的动力学过程。在过去的二十年中，不同于冯·诺伊曼最初提出的元胞自动机模型已经被定义为模拟现实世界的系统和现象。本文描述了在CARPET语言中标准和非标准并行元胞自动机的设计与实现。CARPET是一种基于元胞自动机的语言，已在MIMD并行计算机上实现。该语言是专门为细胞计算编程而设计的，支持简洁高效的并行细胞算法编码。本文分析了该语言的主要特征，并描述了如何利用它们在并行计算机上实现不同的元胞自动机，从标准模型到其修改和推广。提出了非齐次元胞自动机、分区元胞自动机、异步元胞自动机和概率元胞自动机。

引用次数: 2

Java objects communication on a high performance network Java对象在高性能网络上通信

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905044

L. Courtrai, Y. Mahéo, Frédéric Raimbault

Local high performance networks availability already makes workstations clusters a serious alternative for parallel computing. However a high level and effective programming language for such architecture is still missing. Recent works show the interest in Java for cluster programming. One of the main issues is to handle efficiently the communication of objects to really take advantage of the network speed. The paper presents an alternative to the standard serialization process through the proposal of a Java object communication library. Object allocation is controlled in such a way that the transfer of objects between two nodes comes to a direct memory to memory dump. We show how specific allocation mechanisms can cooperate with a Java Virtual Machine so that fast transfers of graphs of objects can be achieved. Experimental results are given for basic operations and for a genetic programming application; they demonstrate a dramatic change in the transfer speed.

本地高性能网络的可用性已经使工作站集群成为并行计算的重要替代方案。然而，针对这种体系结构的高级而有效的编程语言仍然缺失。最近的作品显示了对Java集群编程的兴趣。其中一个主要问题是有效地处理对象之间的通信，以真正利用网络速度的优势。本文通过提出一个Java对象通信库，提出了一种替代标准序列化过程的方法。对象分配的控制方式是这样的:两个节点之间的对象传输会产生直接的内存到内存转储。我们将展示特定的分配机制如何与Java虚拟机协作，从而实现对象图的快速传输。给出了基本运算和遗传规划应用的实验结果;他们展示了传输速度的巨大变化。

引用次数: 2

Predicting the time of oblivious programs 预测遗忘程序的时间

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905063

J. González, C. León, J. R. García, C. Rodríguez, F. D. Sande, F. Piccoli, A. M. Printista

The BSP model can be extended with a zero cost synchronization mechanism that can be used when the numbers of messages due to receive is known. This mechanism, usually known as "oblivious synchronization", implies that different processors can be in different supersteps at the same time. An unwanted consequence of these software improvements is a loss of accuracy in prediction. This paper proposes an extension of the BSP complexity model to deal with oblivious barriers and shows its accuracy.

可以使用零成本同步机制扩展BSP模型，当需要接收的消息数量已知时，可以使用该机制。这种机制通常被称为“无关同步”，它意味着不同的处理器可以同时处于不同的超步骤中。这些软件改进的一个不希望看到的后果是预测准确性的丧失。本文提出了一种扩展的BSP复杂度模型来处理无关障碍，并证明了其准确性。

引用次数: 0

A freeze/unfreeze mechanism for the LinuxThreads library LinuxThreads库的冻结/解冻机制

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905020

P. Lombard, Y. Denneulin

Clusters of standard components are becoming a viable alternative to traditional supercomputers. The typical architecture of these clusters is standard PCs connected by a high performance network. Another rising interest is in the use of idle computers for computation. The operating system used on this kind of platform is generally Linux because it is stable, and flexible: it can be studied, modified and tuned. When using a parallel architecture two important points are fault tolerance and load-balancing of activities scheduling. This is especially true in the context of clusters shared between users and applications and that relies on hardware not as robust as dedicated parallel machines. To provide these two services it is necessary to have a mechanism to stop, freeze, activities in a preemptive manner and, of course, one to restore them in the state they were when frozen. In this paper we present our proposal to modify the LinuxThreads library to provide this service. We do an analysis of how this library works and also give some performance results of the modified library.

标准组件集群正在成为传统超级计算机的可行替代方案。这些集群的典型架构是通过高性能网络连接的标准pc。另一个日益增长的兴趣是利用空闲的计算机进行计算。在这种平台上使用的操作系统通常是Linux，因为它稳定且灵活:可以研究、修改和调优。当使用并行体系结构时，两个重要的点是容错性和活动调度的负载平衡。在用户和应用程序之间共享集群的上下文中尤其如此，并且依赖于不像专用并行机器那样健壮的硬件。为了提供这两种服务，必须有一种机制来以先发制人的方式停止、冻结活动，当然，还必须有一种机制来将它们恢复到冻结时的状态。在本文中，我们提出了修改LinuxThreads库以提供此服务的建议。我们对这个库的工作原理进行了分析，并给出了修改后的库的一些性能结果。

引用次数: 0

Supporting remote reference updating through garbage collection in a mobile object system 支持在移动对象系统中通过垃圾收集进行远程引用更新

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905014

M. Avvenuti, Alessio Vecchio

Programming paradigms based on object mobility, such as mobile agents, can greatly contribute in designing distributed applications, but also introduce issues which are not present when objects are statically bound to their execution environments. Auxiliary mechanisms are necessary in order to allow an application to control mobile objects as well as to provide mobile agents with the capability of interacting with each other despite of mobility. The work described in this paper deals with how to build a mobile objects system based on the Java distributed object model. In particular we describe how to take advantage of the Java RMI's distributed garbage collector to implement an effective remote reference updating scheme, necessary, to support object interaction even in the presence of mobility.

基于对象移动性的编程范例(例如移动代理)可以极大地有助于设计分布式应用程序，但也会引入一些问题，这些问题在对象静态绑定到其执行环境时不会出现。辅助机制是必要的，以便允许应用程序控制移动对象，并为移动代理提供相互交互的能力，尽管具有移动性。本文研究的是如何在Java分布式对象模型的基础上构建一个移动对象系统。特别是，我们描述了如何利用Java RMI的分布式垃圾收集器来实现有效的远程引用更新方案，这是必要的，即使在存在移动性的情况下也能支持对象交互。

引用次数: 1

A smart memory architecture for the efficient support of artificial neural nets 一种有效支持人工神经网络的智能存储器架构

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905074

K. Großpietsch, J. Büddefeld

A "smart memory" approach is presented, i.e. the new architecture is achieved by extending the functionality of a conventional RAM structure. The architecture additionally contains two innovative features: To every word cell of w bits, a small q bits wide ALU is associated; and by means of extending the memory decoder, multiple access to certain sets of word cells within the memory as well as activation of their ALUs is possible. It is shown that based on these features, the standard numerical problem of adding up the m components of a vector of dimension m, in the new architecture can be carried out in a time complexity of O(square root(m)). For the execution of artificial neural nets, especially the on-line recognition of patterns mainly depends on the time-efficient efficient execution of weighted sums. It is shown that in our architecture, these weighted sums can be computed quite efficiently. The computation time is highly superior to the time complexity on sequential von Neumann machines. In addition, we show that if requested, the training mode of a neural net can also be significantly be speeded up. This is achieved by means of a simple crossbar switch which can be modularly added to the array of memory chips.

提出了一种“智能存储器”方法，即通过扩展传统RAM结构的功能来实现新架构。该架构还包含两个创新特性:对于每个w位的字单元，关联一个小的q位宽的ALU;通过扩展记忆解码器，可以多次访问记忆中的某些词单元集并激活它们的alu。结果表明，基于这些特征，在新的体系结构中，将m维向量的m个分量相加的标准数值问题可以在0(平方根(m))的时间复杂度内完成。对于人工神经网络的执行，特别是模式的在线识别，主要依赖于时间效率的加权和的高效执行。结果表明，在我们的体系结构中，这些加权和可以相当有效地计算出来。计算时间大大优于顺序冯·诺依曼机器的时间复杂度。此外，我们还表明，如果有要求，神经网络的训练模式也可以显著加快。这是通过一个简单的交叉开关来实现的，该开关可以模块化地添加到存储芯片阵列中。

{"title":"A smart memory architecture for the efficient support of artificial neural nets","authors":"K. Großpietsch, J. Büddefeld","doi":"10.1109/EMPDP.2001.905074","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905074","url":null,"abstract":"A \"smart memory\" approach is presented, i.e. the new architecture is achieved by extending the functionality of a conventional RAM structure. The architecture additionally contains two innovative features: To every word cell of w bits, a small q bits wide ALU is associated; and by means of extending the memory decoder, multiple access to certain sets of word cells within the memory as well as activation of their ALUs is possible. It is shown that based on these features, the standard numerical problem of adding up the m components of a vector of dimension m, in the new architecture can be carried out in a time complexity of O(square root(m)). For the execution of artificial neural nets, especially the on-line recognition of patterns mainly depends on the time-efficient efficient execution of weighted sums. It is shown that in our architecture, these weighted sums can be computed quite efficiently. The computation time is highly superior to the time complexity on sequential von Neumann machines. In addition, we show that if requested, the training mode of a neural net can also be significantly be speeded up. This is achieved by means of a simple crossbar switch which can be modularly added to the array of memory chips.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"269 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116837376","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Visual data-parallel programming for signal processing applications 信号处理应用的可视化数据并行编程

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-07 DOI: 10.1109/EMPDP.2001.905031

Pierre Boulet, J. Dekeyser, Jean-Luc Levaire, P. Marquet, J. Soula, A. Demeure

Matrix manipulation programs are easily developed using a visual language. For signal processing, a graph of tasks operates on arrays. Each task iterates the same code on different patterns tilling these arrays. In this case visual specifications of dependencies between the pattern elements are enough to define an application. From the ARRAY-OL language developed by Thomson Marconi Sonar, we propose a graphical environment, GASPARD, dedicated to the data-parallel paradigm. Only elementary SPMD tasks are textual. A full environment has been implemented; it includes a graphical editor, a code transformer and a code generator for SMP computers.

矩阵操作程序很容易使用可视化语言开发。对于信号处理，任务图在数组上操作。每个任务在这些数组的不同模式上迭代相同的代码。在这种情况下，模式元素之间依赖关系的可视化规范足以定义应用程序。从Thomson Marconi Sonar开发的ARRAY-OL语言中，我们提出了一个专用于数据并行范式的图形环境GASPARD。只有基本的SPMD任务是文本的。一个完整的环境已经实现;它包括一个图形编辑器、一个代码转换器和一个用于SMP计算机的代码生成器。

引用次数: 21

Remote debugging of CORBA objects 远程调试CORBA对象

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-01 DOI: 10.1109/EMPDP.2001.905067

Magdalena Sujecka, B. Wiszniewski

The paper describes an idea and initial experience gained in applying fault-tolerance mechanisms, namely object replication, for on-line debugging of remote objects of distributed software applications. It examines available object-oriented platforms supporting fault-tolerance and mechanisms enabling implementation of remote object debugging. It also reviews this concept from the perspective of the coming CORBA 3 standard.

本文描述了应用容错机制(即对象复制)对分布式软件应用的远程对象进行在线调试的思想和初步经验。它研究了支持容错的可用面向对象平台和支持远程对象调试实现的机制。本文还从即将到来的CORBA 3标准的角度回顾了这个概念。

引用次数: 1

Towards a cost model for distributed and replicated data stores 面向分布式和复制数据存储的成本模型

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-01 DOI: 10.1109/EMPDP.2001.905075

H. Stockinger, Kurt Stockinger, E. Schikuta, I. Willers

Large, Petabyte-scale data stores need detailed design considerations about distributing and replicating particular parts of the data store in a cost-effective way. Technical issues need to be analysed and, based on these constraints, an optimisation problem can be formulated. In this paper we provide a novel cost model for building a world-wide distributed Petabyte data store which will be in place starting from 2005 at CERN and its collaborating, world-wide distributed institutes. We elaborate on a framework for assessing potential system costs and influences which are essential for the design of the data store.

pb级的大型数据存储需要详细的设计考虑，以经济有效的方式分布和复制数据存储的特定部分。需要对技术问题进行分析，并根据这些限制条件制定优化问题。在本文中，我们提供了一个新的成本模型，用于建立一个全球分布式的pb数据存储，该存储将于2005年开始在CERN及其合作的全球分布式研究所实施。我们详细阐述了一个框架，用于评估潜在的系统成本和影响，这对数据存储的设计至关重要。

引用次数: 50

Static and dynamic scheduling algorithms for scalable Web server farm 用于可扩展Web服务器场的静态和动态调度算法

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

Pub Date : 2001-02-01 DOI: 10.1109/EMPDP.2001.905064

E. Casalicchio, Salvatore Tucci

Multiprocessor-based servers are often used for building popular Web sites which have to guarantee an acceptable Quality of Web Service. In common multi-node systems, namely Web server farms, a Web switch (say, Dispatcher) routes client requests among the server nodes. This architecture resembles a traditional cluster in which a global scheduler dispatches parallel applications among the server nodes. The main difference is that the load reaching Web server farms tends to occur in waves with intervals of heavy peaks. These heavy-tailed characteristics have motivated the use of policies based on dynamic state information for global scheduling in Web server farms. This paper presents an accurate comparison between static and dynamic policies for different classes of Web sites. The goal is to identify main features of architectures and load management algorithms that guarantee scalable Web services. We verify that a Web farm with a Dispatcher with full control on client connections is a very robust architecture. Indeed, we demonstrate that if the Web sire provides only HTML pages or simple database searches, the Dispatcher does not need to use sophisticated scheduling algorithms even if the load occurs in heavy bursts. Dynamic scheduling policies appears to be necessaly for scalability only when most requests are for Web services of three or more orders of magnitude higher than providing HTML pages with some embedded objects.

基于多处理器的服务器通常用于构建流行的Web站点，这些站点必须保证可接受的Web服务质量。在常见的多节点系统(即Web服务器场)中，Web交换机(例如Dispatcher)在服务器节点之间路由客户机请求。这种体系结构类似于传统的集群，其中全局调度器在服务器节点之间调度并行应用程序。主要区别在于，到达Web服务器场的负载往往以波的形式出现，其间会出现高峰。这些重尾特征促使在Web服务器场中使用基于动态状态信息的策略进行全局调度。本文对不同类别的网站的静态和动态策略进行了准确的比较。目标是确定保证可伸缩Web服务的体系结构和负载管理算法的主要特性。我们验证了具有完全控制客户机连接的Dispatcher的Web场是一个非常健壮的体系结构。实际上，我们演示了如果Web服务器只提供HTML页面或简单的数据库搜索，那么Dispatcher不需要使用复杂的调度算法，即使负载发生在大量突发事件中。对于可伸缩性来说，只有当大多数Web服务请求的量级比提供带有一些嵌入对象的HTML页面高三个或更多个数量级时，动态调度策略才显得必要。

{"title":"Static and dynamic scheduling algorithms for scalable Web server farm","authors":"E. Casalicchio, Salvatore Tucci","doi":"10.1109/EMPDP.2001.905064","DOIUrl":"https://doi.org/10.1109/EMPDP.2001.905064","url":null,"abstract":"Multiprocessor-based servers are often used for building popular Web sites which have to guarantee an acceptable Quality of Web Service. In common multi-node systems, namely Web server farms, a Web switch (say, Dispatcher) routes client requests among the server nodes. This architecture resembles a traditional cluster in which a global scheduler dispatches parallel applications among the server nodes. The main difference is that the load reaching Web server farms tends to occur in waves with intervals of heavy peaks. These heavy-tailed characteristics have motivated the use of policies based on dynamic state information for global scheduling in Web server farms. This paper presents an accurate comparison between static and dynamic policies for different classes of Web sites. The goal is to identify main features of architectures and load management algorithms that guarantee scalable Web services. We verify that a Web farm with a Dispatcher with full control on client connections is a very robust architecture. Indeed, we demonstrate that if the Web sire provides only HTML pages or simple database searches, the Dispatcher does not need to use sophisticated scheduling algorithms even if the load occurs in heavy bursts. Dynamic scheduling policies appears to be necessaly for scalability only when most requests are for Web services of three or more orders of magnitude higher than providing HTML pages with some embedded objects.","PeriodicalId":262971,"journal":{"name":"Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126567610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 39

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings Ninth Euromicro Workshop on Parallel and Distributed Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀