首页 > 最新文献

Programming Models for Massively Parallel Computers最新文献

英文 中文
Comparing scalable programming techniques for weather prediction 比较可扩展的天气预报编程技术
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504348
B. Rodriguez, L. Hart, T. Henderson
In this paper we study the of issues of programmability and performance in the parallelization of weather prediction models. We compare parallelization using a high level library (the Nearest Neighbor Tool: NNT) and a high level language/directive approach (High Performance Fortran: HPF). We report on the performance of a complete weather prediction model (the Rapid Update Cycle, which is currently run operationally at the National Meteorological Center at Washington) coded using NNT. We quantify the performance effects of optimizations possible with NNT that must be performed by an HPF compiler.
本文研究了天气预报模型并行化中的可编程性和性能问题。我们比较了使用高级库(最近邻工具:NNT)和高级语言/指令方法(高性能Fortran: HPF)的并行化。我们报告了使用NNT编码的完整天气预报模型(快速更新周期,目前在华盛顿国家气象中心运行)的性能。我们量化了必须由HPF编译器执行的NNT可能的优化的性能影响。
{"title":"Comparing scalable programming techniques for weather prediction","authors":"B. Rodriguez, L. Hart, T. Henderson","doi":"10.1109/PMMPC.1995.504348","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504348","url":null,"abstract":"In this paper we study the of issues of programmability and performance in the parallelization of weather prediction models. We compare parallelization using a high level library (the Nearest Neighbor Tool: NNT) and a high level language/directive approach (High Performance Fortran: HPF). We report on the performance of a complete weather prediction model (the Rapid Update Cycle, which is currently run operationally at the National Meteorological Center at Washington) coded using NNT. We quantify the performance effects of optimizations possible with NNT that must be performed by an HPF compiler.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123102365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Optimizing atomic functions using compile-time information 使用编译时信息优化原子函数
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504343
T. Ruhl, H. Bal
Functions that invoke operations on multiple objects atomically are a useful extension of object-based parallel languages, such as Orca. This paper introduces atomic functions and shows how compile-time information can drive run-time optimizations of such functions.
自动调用多个对象上的操作的函数是基于对象的并行语言(如Orca)的有用扩展。本文介绍了原子函数,并展示了编译时信息如何驱动这些函数的运行时优化。
{"title":"Optimizing atomic functions using compile-time information","authors":"T. Ruhl, H. Bal","doi":"10.1109/PMMPC.1995.504343","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504343","url":null,"abstract":"Functions that invoke operations on multiple objects atomically are a useful extension of object-based parallel languages, such as Orca. This paper introduces atomic functions and shows how compile-time information can drive run-time optimizations of such functions.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131512150","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
V-nested parallelism in C C语言中的v嵌套并行
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504355
M. Chakravarty, F. Schroer, M. Simons
This paper describes the integration of nested data parallelism into imperative languages using the example of C. Unlike flat data parallelism, nested data parallelism directly provides means for handling irregular data structures and certain forms of control parallelism, such as divide-and-conquer algorithms, thus enabling the programmer to express such algorithms far more naturally. Existing work deals with nested data parallelism in a functional environment, which does help avoid a set of problems, but makes efficient implementations more complicated. Moreover functional languages are not readily accepted by programmers used to languages, such as Fortran and C, which are currently predominant in programming parallel machines. In this paper, we introduce the imperative data-parallel language V and give an overview of its implementation.
本文以c语言为例,描述了将嵌套数据并行集成到命使式语言中的方法。与平面数据并行不同,嵌套数据并行直接提供了处理不规则数据结构和某些形式的控制并行的方法,例如分治算法,从而使程序员能够更自然地表达这些算法。现有工作处理功能环境中的嵌套数据并行性,这确实有助于避免一组问题,但使高效实现变得更加复杂。此外,函数式语言不容易被习惯于使用Fortran和C等语言的程序员所接受,而这些语言目前在并行机器编程中占主导地位。本文介绍了命令式数据并行语言V,并对其实现进行了概述。
{"title":"V-nested parallelism in C","authors":"M. Chakravarty, F. Schroer, M. Simons","doi":"10.1109/PMMPC.1995.504355","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504355","url":null,"abstract":"This paper describes the integration of nested data parallelism into imperative languages using the example of C. Unlike flat data parallelism, nested data parallelism directly provides means for handling irregular data structures and certain forms of control parallelism, such as divide-and-conquer algorithms, thus enabling the programmer to express such algorithms far more naturally. Existing work deals with nested data parallelism in a functional environment, which does help avoid a set of problems, but makes efficient implementations more complicated. Moreover functional languages are not readily accepted by programmers used to languages, such as Fortran and C, which are currently predominant in programming parallel machines. In this paper, we introduce the imperative data-parallel language V and give an overview of its implementation.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129448705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Automatic mapping of stream-processing functional programs 流处理功能程序的自动映射
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504340
J. Dennis
Functional programming languages are well suited to the expression and automatic mapping of parallel computations. The Paradigm compiler is being developed to automatically analyze and snap a class of Sisal programs amenable to static analysis for execution by distributed computer systems. A program description tree is constructed to represent the source program. In this form, the program is transformed into a hierarchy of acyclic interconnections of program modules (code blocks) of two basic types: array generators and stream producers. The program description tree is then used to guide decisions about allocation of processing elements to code blocks, and to construct of code for a target multiprocessor. We discuss the problem of finding an optimal allocation (mapping), and illustrate the methodology using a practical signal processing example.
函数式编程语言非常适合并行计算的表达式和自动映射。开发Paradigm编译器是为了自动分析和捕捉一类适合于静态分析的Sisal程序,以便由分布式计算机系统执行。构造一个程序描述树来表示源程序。在这种形式下,程序被转换为两种基本类型的程序模块(代码块)的无循环互连的层次结构:数组生成器和流生成器。然后,程序描述树用于指导有关将处理元素分配到代码块的决策,以及为目标多处理器构造代码。我们讨论了寻找最优分配(映射)的问题,并通过一个实际的信号处理例子说明了该方法。
{"title":"Automatic mapping of stream-processing functional programs","authors":"J. Dennis","doi":"10.1109/PMMPC.1995.504340","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504340","url":null,"abstract":"Functional programming languages are well suited to the expression and automatic mapping of parallel computations. The Paradigm compiler is being developed to automatically analyze and snap a class of Sisal programs amenable to static analysis for execution by distributed computer systems. A program description tree is constructed to represent the source program. In this form, the program is transformed into a hierarchy of acyclic interconnections of program modules (code blocks) of two basic types: array generators and stream producers. The program description tree is then used to guide decisions about allocation of processing elements to code blocks, and to construct of code for a target multiprocessor. We discuss the problem of finding an optimal allocation (mapping), and illustrate the methodology using a practical signal processing example.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"30 6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122474706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Irregular applications in PROMOTER 在PROMOTER中的不规则应用
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504354
A. Schramm
Parallel computers with distributed memory are gaining popularity on account of their optimal scalability. However, their efficient use requires a locality-preserving mapping of the application's underlying graph structure onto the physical topology of the target platform. PROMOTER is a parallel programming model which supports an automatic mapping by the compiler by making the graph structures explicit and thus processable by the implementation. This article describes how this is done for applications with irregular and dynamic spatial structures.
具有分布式内存的并行计算机由于其最佳的可伸缩性而越来越受欢迎。然而,它们的有效使用需要应用程序的底层图结构到目标平台物理拓扑的位置保持映射。PROMOTER是一种并行编程模型,它通过使图结构显式并因此可被实现处理来支持编译器的自动映射。本文描述了如何为具有不规则和动态空间结构的应用程序执行此操作。
{"title":"Irregular applications in PROMOTER","authors":"A. Schramm","doi":"10.1109/PMMPC.1995.504354","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504354","url":null,"abstract":"Parallel computers with distributed memory are gaining popularity on account of their optimal scalability. However, their efficient use requires a locality-preserving mapping of the application's underlying graph structure onto the physical topology of the target platform. PROMOTER is a parallel programming model which supports an automatic mapping by the compiler by making the graph structures explicit and thus processable by the implementation. This article describes how this is done for applications with irregular and dynamic spatial structures.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127350068","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
A taxonomy of programming models for symmetric multiprocessors and SMP clusters 对称多处理器和SMP集群的编程模型分类
Pub Date : 1995-10-09 DOI: 10.1109/PMMPC.1995.504335
W. Gropp, E. Lusk
The basic processing element, from PCs to large systems, is rapidly becoming a symmetric multiprocessor (SMP). As a result, the nodes of a parallel computer will often be an SMP. The resulting mixed hardware models (combining shared-memory and distributed memory) provide a challenge to system software developers to provide users with programming models that are portable, understandable, and efficient. This paper describes and compares a variety of programming models for a parallel computer made up of SMP nodes.
从个人电脑到大型系统的基本处理元素正迅速成为对称多处理器(SMP)。因此,并行计算机的节点通常是一个SMP。由此产生的混合硬件模型(结合共享内存和分布式内存)向系统软件开发人员提出了一个挑战,即向用户提供可移植、可理解和高效的编程模型。本文描述并比较了由SMP节点组成的并行计算机的各种编程模型。
{"title":"A taxonomy of programming models for symmetric multiprocessors and SMP clusters","authors":"W. Gropp, E. Lusk","doi":"10.1109/PMMPC.1995.504335","DOIUrl":"https://doi.org/10.1109/PMMPC.1995.504335","url":null,"abstract":"The basic processing element, from PCs to large systems, is rapidly becoming a symmetric multiprocessor (SMP). As a result, the nodes of a parallel computer will often be an SMP. The resulting mixed hardware models (combining shared-memory and distributed memory) provide a challenge to system software developers to provide users with programming models that are portable, understandable, and efficient. This paper describes and compares a variety of programming models for a parallel computer made up of SMP nodes.","PeriodicalId":344246,"journal":{"name":"Programming Models for Massively Parallel Computers","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116836656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 41
期刊
Programming Models for Massively Parallel Computers
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1