Performance Evaluation of Spintronic-Based Spiking Neural Networks Using Parallel Discrete-Event Simulation

IF 0.7 4区计算机科学 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS ACM Transactions on Modeling and Computer Simulation Pub Date : 2024-03-05 DOI:10.1145/3649464

Elkin Cruz-Camacho, Siyuan Qian, Ankit Shukla, Neil McGlohon, Shaloo Rakheja, Christopher D. Carothers

{"title":"Performance Evaluation of Spintronic-Based Spiking Neural Networks Using Parallel Discrete-Event Simulation","authors":"Elkin Cruz-Camacho, Siyuan Qian, Ankit Shukla, Neil McGlohon, Shaloo Rakheja, Christopher D. Carothers","doi":"10.1145/3649464","DOIUrl":null,"url":null,"abstract":"Spintronics devices that use the spin of electrons as the information state variable have the potential to emulate neuro-synaptic dynamics and can be realized within a compact form-factor, while operating at ultra-low energy-delay point. In this paper, we benchmark the performance of a spintronics hardware platform designed for handling neuromorphic tasks. To explore the benefits of spintronics-based hardware on realistic neuromorphic workloads, we developed a Parallel Discrete-Event Simulation model called Doryta, which is further integrated with a materials-to-systems benchmarking framework. The benchmarking framework allows us to obtain quantitative metrics on the throughput and energy of spintronics-based neuromorphic computing and compare these against standard CMOS-based approaches. Although spintronics hardware offers significant energy and latency advantages, we find that for larger neuromorphic circuits, the performance is limited by the interconnection networks rather than the spintronics-based neurons and synapses. This limitation can be overcome by architectural changes to the network. Through Doryta we are also able to show the power of neuromorphic computing by simulating Conway’s Game of Life (GoL), thus showing that it is Turing complete. We show that Doryta obtains over 300 × speedup using 1,024 CPU cores when tested on a convolutional, sparse, neural architecture. When scaled-up 64 times, to a 200 million neuron model, the simulation ran in 3:42 minutes for a total of 2000 virtual clock steps. The conservative approach of execution was found to be faster in most cases than the optimistic approach, even when a tie-breaking mechanism to guarantee deterministic execution, was deactivated.","PeriodicalId":50943,"journal":{"name":"ACM Transactions on Modeling and Computer Simulation","volume":"99 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Modeling and Computer Simulation","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1145/3649464","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Spintronics devices that use the spin of electrons as the information state variable have the potential to emulate neuro-synaptic dynamics and can be realized within a compact form-factor, while operating at ultra-low energy-delay point. In this paper, we benchmark the performance of a spintronics hardware platform designed for handling neuromorphic tasks.

To explore the benefits of spintronics-based hardware on realistic neuromorphic workloads, we developed a Parallel Discrete-Event Simulation model called Doryta, which is further integrated with a materials-to-systems benchmarking framework. The benchmarking framework allows us to obtain quantitative metrics on the throughput and energy of spintronics-based neuromorphic computing and compare these against standard CMOS-based approaches. Although spintronics hardware offers significant energy and latency advantages, we find that for larger neuromorphic circuits, the performance is limited by the interconnection networks rather than the spintronics-based neurons and synapses. This limitation can be overcome by architectural changes to the network.

Through Doryta we are also able to show the power of neuromorphic computing by simulating Conway’s Game of Life (GoL), thus showing that it is Turing complete. We show that Doryta obtains over 300 × speedup using 1,024 CPU cores when tested on a convolutional, sparse, neural architecture. When scaled-up 64 times, to a 200 million neuron model, the simulation ran in 3:42 minutes for a total of 2000 virtual clock steps. The conservative approach of execution was found to be faster in most cases than the optimistic approach, even when a tie-breaking mechanism to guarantee deterministic execution, was deactivated.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用并行离散事件仿真评估基于自旋电子的尖峰神经网络的性能

利用电子自旋作为信息状态变量的自旋电子器件具有模拟神经突触动力学的潜力，可以在紧凑的外形尺寸内实现，同时以超低能量延迟点运行。在本文中，我们对专为处理神经形态任务而设计的自旋电子硬件平台的性能进行了基准测试。为了探索基于自旋电子学的硬件在现实神经形态工作负载中的优势，我们开发了一个名为 Doryta 的并行离散事件仿真模型，并将其与材料到系统基准测试框架进一步整合。通过该基准测试框架，我们可以获得基于自旋电子学的神经形态计算的吞吐量和能耗的量化指标，并将其与基于 CMOS 的标准方法进行比较。虽然自旋电子硬件在能量和延迟方面具有显著优势，但我们发现，对于较大的神经形态电路，其性能受限于互连网络，而不是基于自旋电子的神经元和突触。这种限制可以通过改变网络结构来克服。通过 Doryta，我们还能模拟康威的生命游戏（GoL），展示神经形态计算的威力，从而证明它是图灵完备的。我们的研究表明，在卷积、稀疏神经架构上进行测试时，Doryta 使用 1,024 个 CPU 内核的速度提高了 300 倍以上。如果将其放大 64 倍，即 2 亿个神经元模型，模拟运行时间为 3:42 分钟，虚拟时钟步数为 2000 步。我们发现，在大多数情况下，保守的执行方法比乐观的执行方法更快，即使在保证确定性执行的决胜机制被停用的情况下也是如此。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Modeling and Computer Simulation 工程技术-计算机：跨学科应用

CiteScore

2.50

自引率

22.20%

发文量

审稿时长

>12 weeks

期刊介绍： The ACM Transactions on Modeling and Computer Simulation (TOMACS) provides a single archival source for the publication of high-quality research and developmental results referring to all phases of the modeling and simulation life cycle. The subjects of emphasis are discrete event simulation, combined discrete and continuous simulation, as well as Monte Carlo methods. The use of simulation techniques is pervasive, extending to virtually all the sciences. TOMACS serves to enhance the understanding, improve the practice, and increase the utilization of computer simulation. Submissions should contribute to the realization of these objectives, and papers treating applications should stress their contributions vis-á-vis these objectives.