P4: Phase-based power/performance prediction of heterogeneous systems via neural networks

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) Pub Date : 2017-11-13 DOI:10.1109/ICCAD.2017.8203843

Yeseong Kim, Pietro Mercati, A. More, Emily J. Shriver, T. Simunic

{"title":"P4: Phase-based power/performance prediction of heterogeneous systems via neural networks","authors":"Yeseong Kim, Pietro Mercati, A. More, Emily J. Shriver, T. Simunic","doi":"10.1109/ICCAD.2017.8203843","DOIUrl":null,"url":null,"abstract":"The emergence of Internet of Things increases the complexity and the heterogeneity of computing platforms. Migrating workload between various platforms is one way to improve both energy efficiency and performance. Effective migration decisions require accurate estimates of its costs and benefits. To date, these estimates were done by either instrumenting the source code/binaries, thus causing high overhead, or by using power estimates from hardware performance counters, which work well for individual machines, but until now have not been accurate for predicting across different architectures. In this paper, we propose P4, a new Phase-based Power and Performance Prediction framework which identifies cross-platform application power and performance at runtime for heterogeneous computing systems. P4 analyzes and detects machine-independent application phases by characterizing computing platforms offline with a set of benchmarks, and then builds neural network-based models to automatically identify and generalize the complex cross-platform relationships for each benchmark phase. It then leverages these models along with performance counter measurements collected at runtime to estimate performance and power consumption if it were running on a completely different computing platform, including a different CPU architecture, without ever having to run it on there. We evaluate the proposed framework on four commercial heterogeneous platforms, ranging from X86 servers to mobile ARM-based architecture, with 129 industry-standard benchmarks. Our experimental results show that P4 can predict the power and performance changes with only 6.8% and 5.6% error, respectively, even for completely different architectures from the ones applications ran on.","PeriodicalId":126686,"journal":{"name":"2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCAD.2017.8203843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 16

Abstract

The emergence of Internet of Things increases the complexity and the heterogeneity of computing platforms. Migrating workload between various platforms is one way to improve both energy efficiency and performance. Effective migration decisions require accurate estimates of its costs and benefits. To date, these estimates were done by either instrumenting the source code/binaries, thus causing high overhead, or by using power estimates from hardware performance counters, which work well for individual machines, but until now have not been accurate for predicting across different architectures. In this paper, we propose P4, a new Phase-based Power and Performance Prediction framework which identifies cross-platform application power and performance at runtime for heterogeneous computing systems. P4 analyzes and detects machine-independent application phases by characterizing computing platforms offline with a set of benchmarks, and then builds neural network-based models to automatically identify and generalize the complex cross-platform relationships for each benchmark phase. It then leverages these models along with performance counter measurements collected at runtime to estimate performance and power consumption if it were running on a completely different computing platform, including a different CPU architecture, without ever having to run it on there. We evaluate the proposed framework on four commercial heterogeneous platforms, ranging from X86 servers to mobile ARM-based architecture, with 129 industry-standard benchmarks. Our experimental results show that P4 can predict the power and performance changes with only 6.8% and 5.6% error, respectively, even for completely different architectures from the ones applications ran on.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

P4:基于相位的异构系统功率/性能预测

物联网的出现增加了计算平台的复杂性和异构性。在不同平台之间迁移工作负载是提高能源效率和性能的一种方法。有效的迁移决策需要对其成本和收益进行准确的估计。到目前为止，这些估计要么是通过检测源代码/二进制文件来完成的，这样会导致很高的开销，要么是通过使用硬件性能计数器的功率估计来完成的，这对于单个机器来说工作得很好，但到目前为止，对于跨不同体系结构的预测还不准确。在本文中，我们提出了P4，一个新的基于阶段的功率和性能预测框架，它在运行时识别异构计算系统的跨平台应用程序功率和性能。P4通过使用一组基准离线表征计算平台，分析和检测与机器无关的应用阶段，然后构建基于神经网络的模型，自动识别和概括每个基准阶段的复杂跨平台关系。然后，它利用这些模型以及在运行时收集的性能计数器测量来估计在完全不同的计算平台(包括不同的CPU架构)上运行时的性能和功耗，而不必在该平台上运行它。我们在四个商业异构平台上评估了提议的框架，从X86服务器到基于arm的移动架构，有129个行业标准基准。我们的实验结果表明，即使对于与应用程序运行的架构完全不同的架构，P4也可以分别以6.8%和5.6%的误差预测功耗和性能变化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

自引率

0.00%

发文量

期刊最新文献

Clepsydra: Modeling timing flows in hardware designs A case for low frequency single cycle multi hop NoCs for energy efficiency and high performance P4: Phase-based power/performance prediction of heterogeneous systems via neural networks Cyclist: Accelerating hardware development A coordinated synchronous and asynchronous parallel routing approach for FPGAs