Concurr. Pract. Exp.最新文献

英文中文

Methodology and tools for system analysis of parallel pipelines 并行管道系统分析的方法和工具

Concurr. Pract. Exp.

Pub Date : 1999-09-01 DOI: 10.1002/(SICI)1096-9128(199909)11:11%3C655::AID-CPE449%3E3.0.CO;2-7

M. Fleury, N. Sarvan, A. Downton, A. Clark

The pipelines of processor farms PPF design pattern intended for continuous ow embedded systems has been augmented by a software toolkit at the system analysis level Other relevant approaches to system support employing tools are reviewed The PPF structure supports incrementally scalable systems which can meet real time speci cations An outline of the design and development cycle of PPF systems follows The paper considers in detail the prediction component of the cycle A graphical simulation tool for modelling asynchronous pipeline behaviour uses a Java based visual display An extended example showing how the performance tool supports PPF design principles concludes the paper

用于连续嵌入式系统的处理器场PPF设计模式的管道已通过系统分析级别的软件工具包进行了扩展，并回顾了使用工具支持系统支持的其他相关方法。PPF结构支持可满足实时规范的增量可扩展系统。PPF系统的设计和开发周期概述如下一个扩展的示例展示了性能工具如何支持PPF设计原则，从而结束了本文

引用次数: 3

Efficient implementation of a portable parallel programming model for image processing 一种可移植的图像处理并行编程模型的高效实现

Concurr. Pract. Exp.

Pub Date : 1999-09-01 DOI: 10.1002/(SICI)1096-9128(199909)11:11%3C671::AID-CPE450%3E3.0.CO;2-6

P. Morrow, D. Crookes, T. Brown, G. McAleese, D. Roantree, I. Spence

This paper describes a domain specific programming model for execution on parallel and distributed architectures. The model has initially been targeted at the application area of image processing, though the techniques developed may be more generally applicable to other domains where an algebraic or library-based approach is common. Efficiency is achieved by the concept of a self-optimising class library of primitive image processing operations, which allows programs to be written in a high level, algebraic notation and which is automatically parallelised (using an application-specific data parallel approach). The class library is extended automatically with optimised operations, generated by a transformation system, giving improved execution performance. The parallel implementation of the model described here is based on MPI and has been tested on a C40 processor network, a quad-processor Unix workstation, and a network of PCs running Linux. Timings are included to indicate the impact of the automatic optimisation facility (rather than the effect of parallelisation). Copyright © 1999 John Wiley & Sons, Ltd.

本文描述了一种在并行和分布式体系结构上执行的特定领域编程模型。该模型最初是针对图像处理的应用领域，尽管所开发的技术可能更普遍地适用于其他领域，其中代数或基于库的方法是常见的。效率是通过自优化原始图像处理操作类库的概念实现的，它允许程序以高级代数符号编写，并自动并行化(使用特定于应用程序的数据并行方法)。类库通过转换系统生成的优化操作自动扩展，从而提高了执行性能。本文描述的模型的并行实现基于MPI，并已在C40处理器网络、四处理器Unix工作站和运行Linux的pc网络上进行了测试。包括计时来指示自动优化设施的影响(而不是并行化的影响)。版权所有©1999 John Wiley & Sons, Ltd

{"title":"Efficient implementation of a portable parallel programming model for image processing","authors":"P. Morrow, D. Crookes, T. Brown, G. McAleese, D. Roantree, I. Spence","doi":"10.1002/(SICI)1096-9128(199909)11:11%3C671::AID-CPE450%3E3.0.CO;2-6","DOIUrl":"https://doi.org/10.1002/(SICI)1096-9128(199909)11:11%3C671::AID-CPE450%3E3.0.CO;2-6","url":null,"abstract":"This paper describes a domain specific programming model for execution on parallel and distributed architectures. The model has initially been targeted at the application area of image processing, though the techniques developed may be more generally applicable to other domains where an algebraic or library-based approach is common. Efficiency is achieved by the concept of a self-optimising class library of primitive image processing operations, which allows programs to be written in a high level, algebraic notation and which is automatically parallelised (using an application-specific data parallel approach). The class library is extended automatically with optimised operations, generated by a transformation system, giving improved execution performance. The parallel implementation of the model described here is based on MPI and has been tested on a C40 processor network, a quad-processor Unix workstation, and a network of PCs running Linux. Timings are included to indicate the impact of the automatic optimisation facility (rather than the effect of parallelisation). Copyright © 1999 John Wiley & Sons, Ltd.","PeriodicalId":199059,"journal":{"name":"Concurr. Pract. Exp.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133965603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 24

Converting best manual practice into generic automatable strategies for unstructured mesh parallelisation 将最佳手工实践转化为非结构化网格并行化的通用自动化策略

Concurr. Pract. Exp.

Pub Date : 1999-09-01 DOI: 10.1002/(SICI)1096-9128(199909)11:11%3C593::AID-CPE446%3E3.0.CO;2-L

K. McManus, S. Johnson, M. Cross

The manual effort required to convert sequential computational mechanics programs into a useful, scalable parallel form is considerable. Tools that can assist in the conversion process are clearly required. Computer aided parallelisation tools (CAPTools) have been developed to generate efficient parallel code for real world structured grid application codes such as Computational Fluid Dynamics. Automatable single-program multi-data (SPMD) overlapping domain decomposition (DD) techniques established for structured grid codes have been adapted by the authors to manually parallelise unstructured mesh applications. Inspector loops have been used to provide generic techniques for the run-time support necessary to extend the capabilities of CAPTools to automatic implementation of SPMD DD techniques in the parallelisation of unstructured mesh codes. Copyright © 1999 John Wiley & Sons, Ltd.

将顺序计算力学程序转换为有用的、可扩展的并行形式所需的人工工作量是相当大的。显然需要能够协助转换过程的工具。计算机辅助并行化工具(CAPTools)已被开发用于为现实世界的结构化网格应用程序代码(如计算流体动力学)生成高效的并行代码。为结构化网格代码建立的可自动化单程序多数据(SPMD)重叠域分解(DD)技术已被作者用于手动并行化非结构化网格应用程序。检查器循环已被用于提供运行时支持所需的通用技术，以扩展CAPTools的功能，在非结构化网格代码的并行化中自动实现SPMD DD技术。版权所有©1999 John Wiley & Sons, Ltd

引用次数: 3

Accurate performance prediction using visual prototypes 使用视觉原型进行准确的性能预测

Concurr. Pract. Exp.

Pub Date : 1999-09-01 DOI: 10.1002/(SICI)1096-9128(199909)11:11%3C615::AID-CPE447%3E3.0.CO;2-H

George R. Ribeiro-Justo, T. Delaitre, M. Zemerly, S. Winter

Behavioural and performance analysis is a fundamental problem in the development of parallel (and distributed) programs. To address this problem, models and supporting environments are required to enable designers to build and analyse their programs. The model we put forward in this paper combines graphical and textual representations of the program structure and uses discrete-event simulation for performance and behaviour predictions. A graphical environment supports our model, providing, amongst other features, a graphical editor, a simulation engine and a performance and behaviour visualisation tool. A number of case studies using this environment are also provided for illustration and validation of our model. Prediction errors observed in comparisons of real execution and simulation of case studies have accuracy to within 10%.

行为和性能分析是并行(和分布式)程序开发中的一个基本问题。为了解决这个问题，需要模型和支持环境，使设计人员能够构建和分析他们的程序。我们在本文中提出的模型结合了程序结构的图形和文本表示，并使用离散事件模拟进行性能和行为预测。图形环境支持我们的模型，提供图形编辑器、仿真引擎以及性能和行为可视化工具等功能。还提供了许多使用此环境的案例研究，以说明和验证我们的模型。在实际执行和模拟案例研究的比较中观察到的预测误差精度在10%以内。

引用次数: 1

An analytical tool for predicting the performance of parallel relational databases 预测并行关系数据库性能的分析工具

Concurr. Pract. Exp.

Pub Date : 1999-09-01 DOI: 10.1002/(SICI)1096-9128(199909)11:11%3C635::AID-CPE448%3E3.0.CO;2-C

M. H. Williams, E. Dempster, Neven Tomov, C. S. Pua, H. Taylor, A. Burger, J. Lü, P. Broughton

The uptake of parallel DBMSs is being hampered by uncertainty about the impact on performance of porting database applications from sequential to parallel systems. The development of tools which aid the system manager or machine vendor could help to reduce this problem. This paper describes an analytical tool which determines the performance characteristics (in terms of throughput, resource utilisation and response time) of relational database transactions executing on particular machine configurations and provides simple graphical visualisations of these to enable users to obtain rapid insight into particular scenarios. The problems of handling different parallel DBMSs are illustrated with reference to three systems – Ingres, Informix and Oracle. A brief description is also given of two different approaches used to confirm the validity of the analytical approach on which the tool is based. Copyright © 1999 John Wiley & Sons, Ltd.

由于不确定将数据库应用程序从顺序系统移植到并行系统对性能的影响，并行dbms的采用受到了阻碍。帮助系统管理员或机器供应商的工具的开发可以帮助减少这个问题。本文描述了一种分析工具，它可以确定在特定机器配置上执行的关系数据库事务的性能特征(在吞吐量、资源利用率和响应时间方面)，并提供了这些特征的简单图形化可视化，使用户能够快速了解特定场景。本文以三个系统(Ingres、Informix和Oracle)为例说明了处理不同并行dbms的问题。简要描述了两种不同的方法，用于确认工具所基于的分析方法的有效性。版权所有©1999 John Wiley & Sons, Ltd

引用次数: 11

CRAFT: a framework for F90/HPF compiler optimizations CRAFT: F90/HPF编译器优化的框架

Concurr. Pract. Exp.

Pub Date : 1999-08-25 DOI: 10.1002/(SICI)1096-9128(19990825)11:10%3C529::AID-CPE439%3E3.0.CO;2-S

Jan-Jan Wu, Marina C. Chen, J. Cowie

In this paper, we give an overview of the results of the CRAFT optimising compiler project (Fortran 90/HPF subset compilers). We start by describing the theoretical framework within which we designed program transformations for the optimization of interand intraprocedural data motion, as well as the optimizations for parallel loops; we then describe the implementation of the CRAFT compilers for Thinking Machines’ CM-2 and CM-5. We report results from experiments on the Connection Machine CM-5, the IBM SP-2 and a network of UltraSparc workstations. The results demonstrate that these optimizations can achieve significant object code performance improvement. Copyright 1999 John Wiley & Sons, Ltd.

在本文中，我们概述了CRAFT优化编译器项目(Fortran 90/HPF子集编译器)的结果。我们首先描述了理论框架，在这个框架中，我们设计了程序转换，以优化程序间和程序内数据运动，以及并行循环的优化;然后，我们描述了思维机器公司的CM-2和CM-5的CRAFT编译器的实现。我们报告了在连接机CM-5、IBM SP-2和UltraSparc工作站网络上的实验结果。结果表明，这些优化可以实现显著的目标代码性能改进。版权所有1999约翰威利父子有限公司

引用次数: 2

Parallel sensitivity analysis for DAEs with many parameters 多参数DAEs的并行灵敏度分析

Concurr. Pract. Exp.

Pub Date : 1999-08-25 DOI: 10.1002/(SICI)1096-9128(19990825)11:10%3C571::AID-CPE440%3E3.0.CO;2-%23

Wenjie Zhu, L. Petzold

In this paper, we discuss the parallel computation of the sensitivity analysis of systems of differential-algebraic equations (DAEs) with a moderate number of state variables and a large number of sensitivity parameters. Several parallel implementations based on DASSLSO are explored and their performance when using the Message Passing Interface (MPI) on an SGI Origin 2000 is compared. Copyright © 1999 John Wiley & Sons, Ltd.

引用次数: 9

Reproducible execution of SR programs 可重复执行的SR程序

Concurr. Pract. Exp.

Pub Date : 1999-08-10 DOI: 10.1002/(SICI)1096-9128(19990810)11:9%3C479::AID-CPE441%3E3.0.CO;2-S

R. Olsson

Reproducing the execution of a concurrent program is important in debugging and testing. It requires that, regardless of the actual order in which processes may execute, the reproduced execution is identical, with respect to the order in which certain activities occur, to a previously recorded execution. This paper presents a solution to the reproducibility problem for programs written in the SR concurrent programming language. Our solution transforms an arbitrary SR program into one for recording an event sequence and one for replaying from an event sequence. SR provides a rich collection of synchronization mechanisms, including rendezvous, asynchronous message passing, remote procedure call, and dynamic process creation. SR language features allow: exible invocation servicing (e.g., use of invo-cation parameters in selecting an invocation to service in message passing or rendezvous); dynamically created processes and resource (module) instances; dynamic communication paths between processes; and dynamic distribution of programs across multiple machines. Because of these features, adaptations of previous solutions to the reproducibility problem for other languages and notations do not work for SR. Our solution handles all the above features. It results in a naturally distributed control algorithm for programs that are distributed. This paper also describes the implementations of our transformation tools.

重现并发程序的执行在调试和测试中很重要。它要求，无论流程执行的实际顺序如何，复制的执行就某些活动发生的顺序而言与先前记录的执行是相同的。本文提出了用SR并发编程语言编写的程序的可重复性问题的解决方案。我们的解决方案将任意的SR程序转换为一个用于记录事件序列的程序和一个用于从事件序列中重播的程序。SR提供了丰富的同步机制集合，包括集合、异步消息传递、远程过程调用和动态流程创建。SR语言特性允许:灵活的调用服务(例如，在消息传递或集合中选择要服务的调用时使用调用参数);动态创建的流程和资源(模块)实例;进程之间的动态通信路径;以及程序在多台机器上的动态分布。由于这些特性，以前的解决方案对其他语言和符号的再现性问题的适应性不适用于sr。我们的解决方案处理上述所有特性。它为分布式程序带来了一种自然分布的控制算法。本文还描述了我们的转换工具的实现。

{"title":"Reproducible execution of SR programs","authors":"R. Olsson","doi":"10.1002/(SICI)1096-9128(19990810)11:9%3C479::AID-CPE441%3E3.0.CO;2-S","DOIUrl":"https://doi.org/10.1002/(SICI)1096-9128(19990810)11:9%3C479::AID-CPE441%3E3.0.CO;2-S","url":null,"abstract":"Reproducing the execution of a concurrent program is important in debugging and testing. It requires that, regardless of the actual order in which processes may execute, the reproduced execution is identical, with respect to the order in which certain activities occur, to a previously recorded execution. This paper presents a solution to the reproducibility problem for programs written in the SR concurrent programming language. Our solution transforms an arbitrary SR program into one for recording an event sequence and one for replaying from an event sequence. SR provides a rich collection of synchronization mechanisms, including rendezvous, asynchronous message passing, remote procedure call, and dynamic process creation. SR language features allow: exible invocation servicing (e.g., use of invo-cation parameters in selecting an invocation to service in message passing or rendezvous); dynamically created processes and resource (module) instances; dynamic communication paths between processes; and dynamic distribution of programs across multiple machines. Because of these features, adaptations of previous solutions to the reproducibility problem for other languages and notations do not work for SR. Our solution handles all the above features. It results in a naturally distributed control algorithm for programs that are distributed. This paper also describes the implementations of our transformation tools.","PeriodicalId":199059,"journal":{"name":"Concurr. Pract. Exp.","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1999-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125925942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Predicting the execution time of message passing models 预测消息传递模型的执行时间

Concurr. Pract. Exp.

Pub Date : 1999-08-10 DOI: 10.1002/(SICI)1096-9128(19990810)11:9%3C461::AID-CPE436%3E3.0.CO;2-2

J. R. García, C. Rodríguez, Daniel González-Morales, F. Almeida

引用次数: 19

Usefulness of adaptive load sharing for parallel processing on networks of workstations 自适应负载共享对工作站网络并行处理的有用性

Concurr. Pract. Exp.

Pub Date : 1999-07-01 DOI: 10.1002/(SICI)1096-9128(199907)11:8%3C387::AID-CPE432%3E3.0.CO;2-4

S. Clarke, S. Dandamudi

Networks of workstations (NOWs) can be used for parallel processing by using public domain software like PVM. However, NOW-based parallel processing suffers from node heterogeneity, background load variations, and high-latency, low-bandwidth communication network. Previous studies on load sharing in NOW-based systems have indicated that, for applications using the work-pile model, a simple load sharing scheme in which the master process gives a fixed amount of work to the slave processes performs as well as any other, more complex schemes. In this paper, we propose a new adaptive load sharing scheme and evaluate its performance using a Pentium-based NOW machine. The communication network used in the system consists of the standard 10 Mbps Ethernet and the 100 Mbps fast Ethernet. We use both these networks to study their impact on the performance of our new policy. The results presented here indicate that the new policy is useful for computation-intensive applications. Copyright  1999 John Wiley & Sons, Ltd.

通过使用诸如PVM之类的公共领域软件，可以将工作站网络(NOWs)用于并行处理。然而，基于now的并行处理存在节点异构性、后台负载变化、高延迟、低带宽通信网络等问题。先前对基于now的系统中负载共享的研究表明，对于使用工作堆模型的应用程序，一个简单的负载共享方案，即主进程向从进程提供固定数量的工作，其性能与任何其他更复杂的方案一样好。在本文中，我们提出了一种新的自适应负载共享方案，并使用基于奔腾的NOW机器对其性能进行了评估。系统使用的通信网络由标准的10mbps以太网和100mbps快速以太网组成。我们使用这两个网络来研究它们对新政策性能的影响。本文给出的结果表明，新策略对于计算密集型应用是有用的。版权所有1999约翰威利父子有限公司

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Concurr. Pract. Exp.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀