Towards Understanding Job Heterogeneity in HPC: A NERSC Case Study

2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid) Pub Date : 2016-05-16 DOI:10.1109/CCGrid.2016.32

G. P. R. Álvarez, Per-Olov Östberg, E. Elmroth, K. Antypas, R. Gerber, L. Ramakrishnan

引用次数: 18

Abstract

The high performance computing (HPC) scheduling landscape is changing. Increasingly, there are large scientific computations that include high-throughput, data-intensive, and stream-processing compute models. These jobs increase the workload heterogeneity, which presents challenges for classical tightly coupled MPI job oriented HPC schedulers. Thus, it is important to define new analyses methods to understand the heterogeneity of the workload, and its possible effect on the performance of current systems. In this paper, we present a methodology to assess the job heterogeneity in workloads and scheduling queues. We apply the method on the workloads of three current National Energy Research Scientific Computing Center (NERSC) systems in 2014. Finally, we present the results of such analysis, with an observation that heterogeneity might reduce predictability in the jobs' wait time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

理解高性能计算中的工作异质性:NERSC案例研究

高性能计算(HPC)调度格局正在发生变化。越来越多的大型科学计算包括高吞吐量、数据密集型和流处理计算模型。这些作业增加了工作负载的异构性，这对传统的面向紧耦合MPI作业的HPC调度器提出了挑战。因此，定义新的分析方法来理解工作负载的异质性及其对当前系统性能的可能影响是很重要的。在本文中，我们提出了一种评估工作负载和调度队列中的作业异质性的方法。我们将该方法应用于2014年国家能源研究科学计算中心(NERSC)三个现有系统的工作负荷。最后，我们提出了这样的分析结果，并观察到异质性可能会降低工作等待时间的可预测性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)

自引率

0.00%

发文量