SATO: spiking neural network acceleration via temporal-oriented dataflow and architecture

Proceedings of the 59th ACM/IEEE Design Automation Conference Pub Date : 2022-07-10 DOI:10.1145/3489517.3530592

Fangxin Liu, Wenbo Zhao, Zongwu Wang, Yongbiao Chen, Tao Yang, Zhezhi He, Xiaokang Yang, Li Jiang

{"title":"SATO: spiking neural network acceleration via temporal-oriented dataflow and architecture","authors":"Fangxin Liu, Wenbo Zhao, Zongwu Wang, Yongbiao Chen, Tao Yang, Zhezhi He, Xiaokang Yang, Li Jiang","doi":"10.1145/3489517.3530592","DOIUrl":null,"url":null,"abstract":"Event-driven spiking neural networks (SNNs) have shown great promise for being strikingly energy-efficient. SNN neurons integrate the spikes, accumulate the membrane potential, and fire output spike when the potential exceeds a threshold. Existing SNN accelerators, however, have to carry out such accumulation-comparison operation in serial. Repetitive spike generation at each time step not only increases latency as well as overall energy budget, but also incurs memory access overhead of fetching membrane potentials, both of which lessen the efficiency of SNN accelerators. Meanwhile, inherent highly sparse spikes of SNNs lead to imbalanced workloads among neurons that hurdle the utilization of processing elements (PEs). This paper proposes SATO, a temporal-parallel SNN accelerator that accumulates the membrane potential for all time steps in parallel. SATO architecture contains a novel binary adder-search tree to generate the output spike train, which decouples the chronological dependence in the accumulation-comparison operation. Moreover, SATO can evenly dispatch the compressed workloads to all PEs with maximized data locality of input spike trains based on a bucket-sort-based method. Our evaluations show that SATO outperforms the previous ANN accelerator 8-bit version of \"Eyeriss\" by 30.9× in terms of speedup and 12.3×, in terms of energy-saving. Compared with the state-of-the-art SNN accelerator \"SpinalFlow\", SATO can also achieve 6.4× performance gain and 4.8× energy reduction, which is quite impressive for inference.","PeriodicalId":373005,"journal":{"name":"Proceedings of the 59th ACM/IEEE Design Automation Conference","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 59th ACM/IEEE Design Automation Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3489517.3530592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Event-driven spiking neural networks (SNNs) have shown great promise for being strikingly energy-efficient. SNN neurons integrate the spikes, accumulate the membrane potential, and fire output spike when the potential exceeds a threshold. Existing SNN accelerators, however, have to carry out such accumulation-comparison operation in serial. Repetitive spike generation at each time step not only increases latency as well as overall energy budget, but also incurs memory access overhead of fetching membrane potentials, both of which lessen the efficiency of SNN accelerators. Meanwhile, inherent highly sparse spikes of SNNs lead to imbalanced workloads among neurons that hurdle the utilization of processing elements (PEs). This paper proposes SATO, a temporal-parallel SNN accelerator that accumulates the membrane potential for all time steps in parallel. SATO architecture contains a novel binary adder-search tree to generate the output spike train, which decouples the chronological dependence in the accumulation-comparison operation. Moreover, SATO can evenly dispatch the compressed workloads to all PEs with maximized data locality of input spike trains based on a bucket-sort-based method. Our evaluations show that SATO outperforms the previous ANN accelerator 8-bit version of "Eyeriss" by 30.9× in terms of speedup and 12.3×, in terms of energy-saving. Compared with the state-of-the-art SNN accelerator "SpinalFlow", SATO can also achieve 6.4× performance gain and 4.8× energy reduction, which is quite impressive for inference.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

SATO:通过面向时间的数据流和架构来加速神经网络

事件驱动的峰值神经网络(snn)已经显示出惊人的节能前景。SNN神经元整合这些峰，积累膜电位，并在电位超过阈值时发出输出峰。然而，现有的SNN加速器必须串行地进行这种累加比较操作。在每个时间步重复产生尖峰不仅增加了延迟和总能量收支，而且还增加了获取膜电位的存储器访问开销，这两者都降低了SNN加速器的效率。同时，snn固有的高度稀疏峰值导致神经元之间的工作负载不平衡，阻碍了处理元素(PEs)的利用。本文提出了一种时间平行SNN加速器SATO，它可以平行地积累所有时间步长的膜电位。SATO架构包含一种新颖的二加法器搜索树来生成输出尖峰序列，从而解耦了累加比较操作中的时间依赖性。此外，SATO还可以基于桶排序的方法，将压缩后的工作负载均匀地分配到具有最大输入尖峰序列数据局部性的所有pe上。我们的评估表明，SATO比之前的ANN加速器8位版本的“Eyeriss”在加速方面提高了30.9倍，在节能方面提高了12.3倍。与最先进的SNN加速器“SpinalFlow”相比，SATO还可以实现6.4倍的性能增益和4.8倍的能量降低，这对于推理来说是相当令人印象深刻的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 59th ACM/IEEE Design Automation Conference

自引率

0.00%

发文量

期刊最新文献

Timing macro modeling with graph neural networks Thermal-aware optical-electrical routing codesign for on-chip signal communications PHANES ScaleHLS Terminator on SkyNet: a practical DVFS attack on DNN hardware IP for UAV object detection