Thread mapping using system-level throughput prediction model for shared memory multicores

2014 IEEE 33rd International Performance Computing and Communications Conference (IPCCC) Pub Date : 2014-12-01 DOI:10.1109/PCCC.2014.7017045

Reshmi Mitra, B. Joshi, R. Adams

引用次数: 0

Abstract

The primary purpose of the current paper is to design a fast and accurate performance model framework for exploring various thread-to-core mapping strategies (MS) and estimating steady state cycles per instruction (CPI). It is directed towards efficiently exploring these performance metrics for large parallel applications for shared memory multicores. This work establishes a hybrid Markov Chain Model (MCM) and Model Tree (MT) based system-level performance prediction model framework. The model is validated with an Electromagnetics application for 12 different mapping strategies. The average performance prediction error is 0.168% with standard deviation of 3.866%. The total run time of model is of the order of minutes, whereas the actual application execution time is in terms of several days.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用系统级吞吐量预测模型的线程映射用于共享内存多核

本文的主要目的是设计一个快速准确的性能模型框架，用于探索各种线程到核映射策略(MS)和估计每指令稳态周期(CPI)。它旨在为共享内存多核的大型并行应用程序有效地探索这些性能指标。本文建立了一个基于混合马尔可夫链模型和模型树的系统级性能预测模型框架。在电磁学应用中对12种不同的映射策略进行了验证。平均性能预测误差为0.168%，标准差为3.866%。模型的总运行时间大约是几分钟，而实际的应用程序执行时间是几天。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 IEEE 33rd International Performance Computing and Communications Conference (IPCCC)

自引率

0.00%

发文量