Evaluating the performance of software distributed shared memory as a target for parallelizing compilers

Proceedings 11th International Parallel Processing Symposium Pub Date : 1997-04-01 DOI:10.1109/IPPS.1997.580943

A. Cox, S. Dwarkadas, Honghui Lu, W. Zwaenepoel

{"title":"Evaluating the performance of software distributed shared memory as a target for parallelizing compilers","authors":"A. Cox, S. Dwarkadas, Honghui Lu, W. Zwaenepoel","doi":"10.1109/IPPS.1997.580943","DOIUrl":null,"url":null,"abstract":"In this paper we evaluate the use of software distributed shared memory (DSM) on a message passing machine as the target for a parallelizing compiler. We compare this approach to compiler-generated message passing, hand-coded software DSM and hand-coded message passing. For this comparison, we use six applications: four that are regular and two that are irregular: Our results are gathered on an 8-node IBM SP/2 using the TreadMarks software DSM system. We use the APR shared-memory (SPF) compiler to generate the shared memory-programs and the APR XHPF compiler to generate message passing programs. The hand-coded message passing programs run with the IBM PVMe optimized message passing library. On the regular programs, both the compiler-generated and the hand-coded message passing outperform the SPF/TreadMarks combination: the compiler-generated message passing by 5.5% to 40%, and the hand-coded message passing by 7.5% to 49%. On the irregular programs, the SPF/TreadMarks combination outperforms the compiler-generated message passing by 38% and 89%, and only slightly underperforms the hand-coded message passing, differing by 4.4% and 16%. We also identify the factors that account for the performance differences, estimate their relative importance, and describe methods to improve the performance.","PeriodicalId":145892,"journal":{"name":"Proceedings 11th International Parallel Processing Symposium","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 11th International Parallel Processing Symposium","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPPS.1997.580943","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 44

Abstract

In this paper we evaluate the use of software distributed shared memory (DSM) on a message passing machine as the target for a parallelizing compiler. We compare this approach to compiler-generated message passing, hand-coded software DSM and hand-coded message passing. For this comparison, we use six applications: four that are regular and two that are irregular: Our results are gathered on an 8-node IBM SP/2 using the TreadMarks software DSM system. We use the APR shared-memory (SPF) compiler to generate the shared memory-programs and the APR XHPF compiler to generate message passing programs. The hand-coded message passing programs run with the IBM PVMe optimized message passing library. On the regular programs, both the compiler-generated and the hand-coded message passing outperform the SPF/TreadMarks combination: the compiler-generated message passing by 5.5% to 40%, and the hand-coded message passing by 7.5% to 49%. On the irregular programs, the SPF/TreadMarks combination outperforms the compiler-generated message passing by 38% and 89%, and only slightly underperforms the hand-coded message passing, differing by 4.4% and 16%. We also identify the factors that account for the performance differences, estimate their relative importance, and describe methods to improve the performance.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估软件分布式共享内存作为并行编译器目标的性能

在本文中，我们评估了在消息传递机上使用软件分布式共享内存(DSM)作为并行编译器的目标。我们将这种方法与编译器生成的消息传递、手工编码的软件DSM和手工编码的消息传递进行比较。为了进行比较，我们使用了六个应用程序:四个是规则的，两个是不规则的:我们的结果是使用TreadMarks软件DSM系统在8节点IBM SP/2上收集的。我们使用APR共享内存(SPF)编译器生成共享内存程序，使用APR XHPF编译器生成消息传递程序。手工编码的消息传递程序使用IBM PVMe优化的消息传递库运行。在常规程序中，编译器生成的消息传递和手工编码的消息传递都优于SPF/TreadMarks组合:编译器生成的消息传递比SPF/TreadMarks组合高5.5%到40%，手工编码的消息传递比SPF/TreadMarks组合高7.5%到49%。在不规则程序中，SPF/TreadMarks组合的性能比编译器生成的消息传递高出38%和89%，仅略低于手工编码的消息传递，相差4.4%和16%。我们还确定了导致性能差异的因素，估计了它们的相对重要性，并描述了提高性能的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings 11th International Parallel Processing Symposium

自引率

0.00%

发文量