Characterizing On-Chip Traffic Patterns in General-Purpose GPUs: A Deep Learning Approach

2019 IEEE 37th International Conference on Computer Design (ICCD) Pub Date : 2019-11-01 DOI:10.1109/ICCD46524.2019.00016

Yunfan Li, Drew Penney, Abhishek Ramamurthy, Lizhong Chen

{"title":"Characterizing On-Chip Traffic Patterns in General-Purpose GPUs: A Deep Learning Approach","authors":"Yunfan Li, Drew Penney, Abhishek Ramamurthy, Lizhong Chen","doi":"10.1109/ICCD46524.2019.00016","DOIUrl":null,"url":null,"abstract":"Architectural optimizations in general-purpose graphics processing units (GPGPUs) often exploit workload characteristics to reduce power and latency while improving performance. This paper finds, however, that prevailing assumptions about GPGPU traffic pattern characterization are inaccurate. These assumptions must therefore be re-evaluated, and more appropriate new patterns must be identified. This paper proposes a methodology to classify GPGPU traffic patterns, combining a convolutional neural network (CNN) for feature extraction and a t-distributed stochastic neighbor embedding (t-SNE) algorithm to determine traffic pattern clusters. A traffic pattern dataset is generated from common GPGPU benchmarks, transformed using heat mapping, and iteratively refined to ensure appropriate and highly accurate labels. The proposed classification model achieves 98.8% validation accuracy and 94.24% test accuracy. Furthermore, traffic in 96.6% of examined kernels can be classified into the eight identified traffic pattern categories.","PeriodicalId":6698,"journal":{"name":"2019 IEEE 37th International Conference on Computer Design (ICCD)","volume":"57 1","pages":"56-64"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 IEEE 37th International Conference on Computer Design (ICCD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCD46524.2019.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Architectural optimizations in general-purpose graphics processing units (GPGPUs) often exploit workload characteristics to reduce power and latency while improving performance. This paper finds, however, that prevailing assumptions about GPGPU traffic pattern characterization are inaccurate. These assumptions must therefore be re-evaluated, and more appropriate new patterns must be identified. This paper proposes a methodology to classify GPGPU traffic patterns, combining a convolutional neural network (CNN) for feature extraction and a t-distributed stochastic neighbor embedding (t-SNE) algorithm to determine traffic pattern clusters. A traffic pattern dataset is generated from common GPGPU benchmarks, transformed using heat mapping, and iteratively refined to ensure appropriate and highly accurate labels. The proposed classification model achieves 98.8% validation accuracy and 94.24% test accuracy. Furthermore, traffic in 96.6% of examined kernels can be classified into the eight identified traffic pattern categories.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通用gpu的片上流量模式表征:一种深度学习方法

通用图形处理单元(gpgpu)中的体系结构优化通常利用工作负载特性来降低功耗和延迟，同时提高性能。然而，本文发现，关于GPGPU流量模式表征的普遍假设是不准确的。因此，必须重新评估这些假设，并确定更适当的新模式。本文提出了一种GPGPU流量模式分类方法，结合卷积神经网络(CNN)特征提取和t分布随机邻居嵌入(t-SNE)算法确定流量模式聚类。流量模式数据集由通用GPGPU基准生成，使用热映射进行转换，并迭代改进以确保适当和高度准确的标签。该分类模型的验证准确率为98.8%，测试准确率为94.24%。此外，96.6%的检测核的流量可以被划分为8种识别的流量模式类别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE 37th International Conference on Computer Design (ICCD)

自引率

0.00%

发文量