Workload-Balanced Graph Attention Network Accelerator with Top-K Aggregation Candidates

2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD) Pub Date : 2022-10-29 DOI:10.1145/3508352.3549343

Naebeom Park, Daehyun Ahn, Jae-Joon Kim

{"title":"Workload-Balanced Graph Attention Network Accelerator with Top-K Aggregation Candidates","authors":"Naebeom Park, Daehyun Ahn, Jae-Joon Kim","doi":"10.1145/3508352.3549343","DOIUrl":null,"url":null,"abstract":"Graph attention networks (GATs) are gaining attention for various transductive and inductive graph processing tasks due to their higher accuracy than conventional graph convolutional networks (GCNs). The power-law distribution of real-world graph-structured data, on the other hand, causes a severe workload imbalance problem for GAT accelerators. To reduce the degradation of PE utilization due to the workload imbalance, we present algorithm/hardware co-design results for a GAT accelerator that balances workload assigned to processing elements by allowing only K neighbor nodes to participate in aggregation phase. The proposed model selects the K neighbor nodes with high attention scores, which represent relevance between two nodes, to minimize accuracy drop. Experimental results show that our algorithm/hardware co-design of the GAT accelerator achieves higher processing speed and energy eﬃciency than the GAT accelerators using conventional workload balancing techniques. Furthermore, we demonstrate that the proposed GAT accelerators can be made faster than the GCN accelerators that typically process smaller number of computations.","PeriodicalId":270592,"journal":{"name":"2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD)","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3508352.3549343","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Graph attention networks (GATs) are gaining attention for various transductive and inductive graph processing tasks due to their higher accuracy than conventional graph convolutional networks (GCNs). The power-law distribution of real-world graph-structured data, on the other hand, causes a severe workload imbalance problem for GAT accelerators. To reduce the degradation of PE utilization due to the workload imbalance, we present algorithm/hardware co-design results for a GAT accelerator that balances workload assigned to processing elements by allowing only K neighbor nodes to participate in aggregation phase. The proposed model selects the K neighbor nodes with high attention scores, which represent relevance between two nodes, to minimize accuracy drop. Experimental results show that our algorithm/hardware co-design of the GAT accelerator achieves higher processing speed and energy eﬃciency than the GAT accelerators using conventional workload balancing techniques. Furthermore, we demonstrate that the proposed GAT accelerators can be made faster than the GCN accelerators that typically process smaller number of computations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

具有Top-K聚合候选者的工作负载平衡图注意力网络加速器

图注意网络(GATs)由于其比传统的图卷积网络(GCNs)具有更高的精度，在各种换向和归纳图处理任务中越来越受到关注。另一方面，实际图结构数据的幂律分布会导致GAT加速器出现严重的工作负载不平衡问题。为了减少由于工作负载不平衡而导致的PE利用率下降，我们提出了一种GAT加速器的算法/硬件协同设计结果，该加速器通过只允许K个邻居节点参与聚合阶段来平衡分配给处理元素的工作负载。该模型选择了K个具有高关注分数的相邻节点，这些节点代表了两个节点之间的相关性，以最小化准确率下降。实验结果表明，我们的算法/硬件协同设计的GAT加速器比使用传统工作负载平衡技术的GAT加速器具有更高的处理速度和能源效率。此外，我们证明了所提出的GAT加速器可以比通常处理较少计算量的GCN加速器更快。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2022 IEEE/ACM International Conference On Computer Aided Design (ICCAD)

自引率

0.00%

发文量