IEEE Parallel & Distributed Technology: Systems & Applications最新文献

英文中文

Dedos: a distributed real-time environment Dedos:分布式实时环境

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-12-01 DOI: 10.1109/88.345962

D. Hammer, E. Luit, O. V. Roosmalen, P. V. D. Stok, J. Verhoosel

Until now, little research has been done on methods to combine the seemingly incompatible paradigms of hard and soft real-time systems. To this end, we have developed Dedos, a dependable distributed operating system. The driving forces behind the project are twofold: to meet the demand for dependable distributed control systems, especially in the area of embedded systems and industrial control; and to increase the productivity and quality of application programming for distributed control. Our current focus is on hard real-time issues; soft real-time needs are handled by conventional techniques. However, our work has raised interesting questions about the communication between the soft and hard real-time tasks of the system, which is necessary to pass externally specified control parameters and control status information. The problem is that the data set must always be consistent (concurrency atomicity), but hard real-time activities can never be delayed by soft real-time ones. Other intriguing questions are related to the integration of the reliability and security concepts that are used in the two parts of the systems. in this paper, however, we limit our discussion to the Dedos development model, the Dedos programming model, hard real-time scheduling, and the distributed algorithms needed to implement the Dedos execution environment.<>

到目前为止，很少有人研究如何将硬实时系统和软实时系统这两种看似不兼容的范式结合起来。为此，我们开发了Dedos，一个可靠的分布式操作系统。该项目背后的驱动力是双重的:满足对可靠的分布式控制系统的需求，特别是在嵌入式系统和工业控制领域;提高分布式控制应用程序编程的效率和质量。我们目前的重点是硬实时问题;软实时需求由传统技术来处理。然而，我们的工作提出了一个有趣的问题，即系统的软、硬实时任务之间的通信，这是传递外部指定的控制参数和控制状态信息所必需的。问题是数据集必须始终是一致的(并发原子性)，但是硬实时活动永远不会被软实时活动延迟。其他有趣的问题与系统两个部分中使用的可靠性和安全性概念的集成有关。然而，在本文中，我们将讨论限制在Dedos开发模型、Dedos编程模型、硬实时调度以及实现Dedos执行环境所需的分布式算法上。

{"title":"Dedos: a distributed real-time environment","authors":"D. Hammer, E. Luit, O. V. Roosmalen, P. V. D. Stok, J. Verhoosel","doi":"10.1109/88.345962","DOIUrl":"https://doi.org/10.1109/88.345962","url":null,"abstract":"Until now, little research has been done on methods to combine the seemingly incompatible paradigms of hard and soft real-time systems. To this end, we have developed Dedos, a dependable distributed operating system. The driving forces behind the project are twofold: to meet the demand for dependable distributed control systems, especially in the area of embedded systems and industrial control; and to increase the productivity and quality of application programming for distributed control. Our current focus is on hard real-time issues; soft real-time needs are handled by conventional techniques. However, our work has raised interesting questions about the communication between the soft and hard real-time tasks of the system, which is necessary to pass externally specified control parameters and control status information. The problem is that the data set must always be consistent (concurrency atomicity), but hard real-time activities can never be delayed by soft real-time ones. Other intriguing questions are related to the integration of the reliability and security concepts that are used in the two parts of the systems. in this paper, however, we limit our discussion to the Dedos development model, the Dedos programming model, hard real-time scheduling, and the distributed algorithms needed to implement the Dedos execution environment.<<ETX>>","PeriodicalId":325213,"journal":{"name":"IEEE Parallel & Distributed Technology: Systems & Applications","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125690306","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 30

Task Parallelism in a High Performance Fortran Framework 高性能Fortran框架中的任务并行性

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-09-01 DOI: 10.1109/M-PDT.1994.329791

T. Gross, D. O'Hallaron, J. Subhlok

Exploiting both data and task parallelism in a single framework is the key to achieving good performance for a variety of applications.

在单一框架中利用数据和任务并行性是为各种应用程序实现良好性能的关键。

引用次数: 110

Extending HPF for Advanced Data-Parallel Applications 为高级数据并行应用扩展HPF

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-09-01 DOI: 10.1109/M-PDT.1994.329803

B. Chapman, H. Zima, P. Mehrotra

High Performance Fortran can support regular numerical algorithms, but it cannot adequately express advanced applications such as particle-in-cell codes or unstructured mesh solvers.This article addresses this problem and outlines possible development paths.

高性能Fortran可以支持常规的数值算法，但它不能充分表达高级应用程序，如单元内粒子代码或非结构化网格求解器。本文解决了这个问题，并概述了可能的开发路径。

引用次数: 87

Task Parallelism and High-Performance Languages 任务并行和高性能语言

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-09-01 DOI: 10.1109/M-PDT.1994.329794

Ian T Foster

The definition of High Performance Fortran (HPF) is a significant event in the maturation of parallel computing: it represents the first parallel language that has gained widespread support from vendors and users. The subject of this paper is to incorporate support for task parallelism. The term task parallelism refers to the explicit creation of multiple threads of control, or tasks, which synchronize and communicate under programmer control. Task and data parallelism are complementary rather than competing programming models. While task parallelism is more general and can be used to implement algorithms that are not amenable to data-parallel solutions, many problems can benefit from a mixed approach, with for example a task-parallel coordination layer integrating multiple data-parallel computations. Other problems admit to both data- and task-parallel solutions, with the better solution depending on machine characteristics, compiler performance, or personal taste. For these reasons, we believe that a general-purpose high-performance language should integrate both task- and data-parallel constructs. The challenge is to do so in a way that provides the expressivity needed for applications, while preserving the flexibility and portability of a high-level language. In this paper, we examine and illustrate the considerations that motivate the use of task parallelism. We also describe one particular approach to task parallelism in Fortran, namely the Fortran M extensions. Finally, we contrast Fortran M with other proposed approaches and discuss the implications of this work for task parallelism and high-performance languages.

高性能Fortran (High Performance Fortran, HPF)的定义是并行计算成熟过程中的一个重要事件:它代表了第一个获得供应商和用户广泛支持的并行语言。本文的主题是合并对任务并行性的支持。任务并行是指显式地创建多个控制线程或任务，它们在程序员的控制下进行同步和通信。任务和数据并行是互补的，而不是相互竞争的编程模型。虽然任务并行更通用，并且可用于实现不适合数据并行解决方案的算法，但许多问题都可以从混合方法中受益，例如使用集成多个数据并行计算的任务并行协调层。其他问题则同时采用数据和任务并行解决方案，更好的解决方案取决于机器特性、编译器性能或个人品味。由于这些原因，我们认为通用的高性能语言应该集成任务并行和数据并行结构。挑战在于如何在提供应用程序所需的表达能力的同时，保持高级语言的灵活性和可移植性。在本文中，我们检查并说明了激励使用任务并行的考虑因素。我们还描述了Fortran中实现任务并行的一种特殊方法，即Fortran M扩展。最后，我们将Fortran M与其他提出的方法进行了对比，并讨论了这项工作对任务并行性和高性能语言的影响。

{"title":"Task Parallelism and High-Performance Languages","authors":"Ian T Foster","doi":"10.1109/M-PDT.1994.329794","DOIUrl":"https://doi.org/10.1109/M-PDT.1994.329794","url":null,"abstract":"The definition of High Performance Fortran (HPF) is a significant event in the maturation of parallel computing: it represents the first parallel language that has gained widespread support from vendors and users. The subject of this paper is to incorporate support for task parallelism. The term task parallelism refers to the explicit creation of multiple threads of control, or tasks, which synchronize and communicate under programmer control. Task and data parallelism are complementary rather than competing programming models. While task parallelism is more general and can be used to implement algorithms that are not amenable to data-parallel solutions, many problems can benefit from a mixed approach, with for example a task-parallel coordination layer integrating multiple data-parallel computations. Other problems admit to both data- and task-parallel solutions, with the better solution depending on machine characteristics, compiler performance, or personal taste. For these reasons, we believe that a general-purpose high-performance language should integrate both task- and data-parallel constructs. The challenge is to do so in a way that provides the expressivity needed for applications, while preserving the flexibility and portability of a high-level language. In this paper, we examine and illustrate the considerations that motivate the use of task parallelism. We also describe one particular approach to task parallelism in Fortran, namely the Fortran M extensions. Finally, we contrast Fortran M with other proposed approaches and discuss the implications of this work for task parallelism and high-performance languages.","PeriodicalId":325213,"journal":{"name":"IEEE Parallel & Distributed Technology: Systems & Applications","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115934664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 44

Requirements for DataParallel Programming Environments 数据并行编程环境的要求

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-09-01 DOI: 10.1109/M-PDT.1994.329801

Vikram S. Adve, A. Carle, Elana D. Granston, S. Hiranandani, K. Kennedy, C. Koelbel, U. Kremer, J. Mellor-Crummey, S. Warren, C. Tseng

An effective data-parallel programming environment will use a variety of tools that support the development of efficient data-parallel programs while insulating the programmer from the intricacies of the explicitly parallel code.

一个有效的数据并行编程环境将使用各种工具来支持高效的数据并行程序的开发，同时将程序员与显式并行代码的复杂性隔离开来。

引用次数: 30

Automatic Detection of Parallelism: A grand challenge for high performance computing 并行性的自动检测:对高性能计算的巨大挑战

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-09-01 DOI: 10.1109/M-PDT.1994.329796

W. Blume, R. Eigenmann, J. Hoeflinger, D. Padua, Paul Petersen, Lawrence Rauchwerger, P. Tu

The limited ability of compilers to find the parallelism in programs is a significant barrier to the use of high-performance computers.However, a combination of static and runtime techniques can improve compilers to the extent that a significant group of scientific programs can be parallelized automatically.

编译器在程序中发现并行性的有限能力是使用高性能计算机的一个重大障碍。然而，静态和运行时技术的结合可以改进编译器，使一组重要的科学程序可以自动并行化。

引用次数: 95

Exploiting data coherence to improve parallel volume rendering 利用数据一致性改进并行体绘制

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-06-01 DOI: 10.1109/88.311568

P. Mackerras, B. Corrie

We have implemented a parallel volume renderer that successfully manages work and data distribution by exploiting data coherence-the tendency of neighboring pixels to use the same data during rendering, particularly when rendering volume data. This flexible, powerful renderer uses ray-casting on a Fujitsu AP1000 to generate high-quality images of volume data sets with other geometrically defined structures, such as a set of coordinate axes or a world map. This article focuses on our schemes for work and data distribution. Using image-space work distribution to partition a 2D image among processing nodes, and distributed virtual memory to assign 3D volume data, this renderer effectively and efficiently parallelizes volume rendering.<>

我们已经实现了一个并行体渲染器，它通过利用数据一致性成功地管理工作和数据分布——数据一致性是指在渲染过程中相邻像素使用相同数据的趋势，尤其是在渲染体数据时。这个灵活、强大的渲染器使用富士通AP1000上的光线投射来生成具有其他几何定义结构(如一组坐标轴或世界地图)的体数据集的高质量图像。本文重点介绍我们的工作和数据分发方案。该渲染器利用图像空间工作分布对二维图像在处理节点间进行划分，并利用分布式虚拟内存对三维体数据进行分配，有效地实现了体绘制的并行化。

引用次数: 17

Achieving superlinear speedup on a heterogeneous, distributed system 在异构、分布式系统上实现超线性加速

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-06-01 DOI: 10.1109/88.311573

C. Mechoso, J. Farrara, J. A. Spahr

The CASA Gigabit Network Testbed, part of NSF and ARPA's Gigabit Project, is investigating whether a metacomputer consisting of widely distributed, heterogeneous supercomputers connected by a high-speed network is viable for large scientific applications. A particular challenge is to determine if such a metacomputer can produce superlinear speedup despite latency and communication overheads. One of the applications in the CASA testbed is a model we developed that couples a global atmosphere model to a world ocean model. Simulations using such coupled general circulation models for climate studies demand considerable computer resources. When distributing such a model, we need to consider the methods for masking latency with computation, the communications bandwidth requirements for different decomposition strategies, the optimal computer architecture for each major phase of the computation, and the effects of latency and communication costs for different decomposition strategies. Here we focus an the last two issues, and demonstrate that choosing the appropriate computer architectures and masking communication with computation can produce superlinear speedup.<>

CASA千兆网络试验台是美国国家科学基金会和ARPA千兆项目的一部分，正在研究由高速网络连接的广泛分布的异构超级计算机组成的元计算机是否适用于大型科学应用。一个特别的挑战是确定这样的元计算机是否可以在延迟和通信开销的情况下产生超线性加速。CASA试验台的一个应用是我们开发的一个模型，它将全球大气模型与世界海洋模型耦合在一起。利用这种耦合环流模式进行气候研究的模拟需要大量的计算机资源。在分配这种模型时，我们需要考虑用计算掩盖延迟的方法、不同分解策略对通信带宽的要求、计算各主要阶段的最优计算机体系结构以及不同分解策略对延迟和通信成本的影响。本文重点讨论了后两个问题，并证明了选择适当的计算机体系结构和用计算掩盖通信可以产生超线性加速。

{"title":"Achieving superlinear speedup on a heterogeneous, distributed system","authors":"C. Mechoso, J. Farrara, J. A. Spahr","doi":"10.1109/88.311573","DOIUrl":"https://doi.org/10.1109/88.311573","url":null,"abstract":"The CASA Gigabit Network Testbed, part of NSF and ARPA's Gigabit Project, is investigating whether a metacomputer consisting of widely distributed, heterogeneous supercomputers connected by a high-speed network is viable for large scientific applications. A particular challenge is to determine if such a metacomputer can produce superlinear speedup despite latency and communication overheads. One of the applications in the CASA testbed is a model we developed that couples a global atmosphere model to a world ocean model. Simulations using such coupled general circulation models for climate studies demand considerable computer resources. When distributing such a model, we need to consider the methods for masking latency with computation, the communications bandwidth requirements for different decomposition strategies, the optimal computer architecture for each major phase of the computation, and the effects of latency and communication costs for different decomposition strategies. Here we focus an the last two issues, and demonstrate that choosing the appropriate computer architectures and masking communication with computation can produce superlinear speedup.<<ETX>>","PeriodicalId":325213,"journal":{"name":"IEEE Parallel & Distributed Technology: Systems & Applications","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1994-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124394621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 26

ickp: a consistent checkpointer for multicomputers 用于多台计算机的一致检查指针

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-06-01 DOI: 10.1109/88.311574

J. Plank, Kai Li

There has been much research on checkpointing algorithms for parallel and distributed systems; but surprisingly few implementations for uniprocessors, multiprocessors, and distributed systems, and none at all for multicomputers. We discuss ickp, our consistent checkpointer for the Intel iPSC/860, which is the first general-purpose checkpointer for a multicomputer. It is a checkpointing library that may be invoked asynchronously from the host processor, at a periodic interval, or by a library call. It implements three consistent checkpointing algorithms, two optimizations to reduce checkpoint time and overhead, and recovery.<>

在并行和分布式系统中，检查点算法已经得到了大量的研究;但令人惊讶的是，针对单处理器、多处理器和分布式系统的实现很少，而针对多计算机的实现则根本没有。我们讨论了ickp，我们为Intel iPSC/860提供的一致校验指针，它是多计算机的第一个通用校验指针。它是一个检查点库，可以从主机处理器异步调用，也可以定期调用，也可以通过库调用调用。它实现了三种一致的检查点算法，两种优化以减少检查点时间和开销，以及恢复

引用次数: 123

Parallel polygon rendering for message-passing architectures 用于消息传递体系结构的并行多边形呈现

IEEE Parallel & Distributed Technology: Systems & Applications

Pub Date : 1994-06-01 DOI: 10.1109/88.311569

T. Crockett, T. Orloff

Applications such as real-time animation and scientific visualization demand high performance for rendering complex 3D abstract data models into 2D images. As large applications migrate to highly parallel supercomputers, how can we exploit the available parallelism to keep the rendering on the supercomputer? To answer this question, we developed a parallel polygon renderer for general-purpose MIMD distributed-memory message-passing systems. It exploits object-level and image-level parallelism, and can run on systems containing from one processor to a number bounded by the number of scan lines in the resulting image. Unlike earlier approaches, ours multiplexes the transformation and rasterization phases on the same machine. This reduces memory usage and network contention, and overlaps computation and communication.<>

实时动画和科学可视化等应用需要将复杂的3D抽象数据模型渲染为2D图像的高性能。随着大型应用程序迁移到高度并行的超级计算机，我们如何利用可用的并行性来保持超级计算机上的渲染?为了回答这个问题，我们为通用的MIMD分布式内存消息传递系统开发了一个并行多边形呈现器。它利用了对象级和图像级的并行性，并且可以在包含从一个处理器到一个数量的系统上运行，这个数量以生成的图像中的扫描线数量为限。与早期的方法不同，我们的方法在同一台机器上复用转换和光栅化阶段。这减少了内存使用和网络争用，并使计算和通信重叠

引用次数: 24

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

IEEE Parallel & Distributed Technology: Systems & Applications

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀