Training SNNs Low Latency Utilizing Batch Normalization Through Time and Iterative Initialization Retraining

Thi Diem Tran, Huu-Hanh Hoang
{"title":"Training SNNs Low Latency Utilizing Batch Normalization Through Time and Iterative Initialization Retraining","authors":"Thi Diem Tran, Huu-Hanh Hoang","doi":"10.1109/ICAIIC57133.2023.10067096","DOIUrl":null,"url":null,"abstract":"Spiking Neural Network (SNN), developing on neuromorphic hardware, is a promising energy-efficient AI paradigm. However, processing over several timesteps reduces the energy benefits of SNNs due to high latency, the number of operations, and memory access costs from acquiring membrane potentials. Furthermore, their non-derivative nature makes SNNs difficult to train properly. To overcome these issues and leverage the full potential of SNNs, in this research, we offer a novel way for training deep SNNs utilizing Batch Normalization Through Time and Iterative Initialization and Retraining techniques. First, the BNTT improves low-latency and low-energy training in SNNs by allowing neurons to handle the spike rate over many timesteps. Second, we can obtain SNNs with up to unit latency pass during inference when applying the Iterative Initialization and Retraining technique during training SNNs. On the CIFAR-10, CIFAR-100, and ImageNet, we achieve cutting-edge SNN performance using a deep neural network with just one timestep. We achieve top-1 accuracy of 91.01%, 71.88%, and 69.8% on CIFAR-10, CIFAR-100, and ImageNet, respectively, using the VGG 16 architecture.","PeriodicalId":105769,"journal":{"name":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Artificial Intelligence in Information and Communication (ICAIIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIIC57133.2023.10067096","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Spiking Neural Network (SNN), developing on neuromorphic hardware, is a promising energy-efficient AI paradigm. However, processing over several timesteps reduces the energy benefits of SNNs due to high latency, the number of operations, and memory access costs from acquiring membrane potentials. Furthermore, their non-derivative nature makes SNNs difficult to train properly. To overcome these issues and leverage the full potential of SNNs, in this research, we offer a novel way for training deep SNNs utilizing Batch Normalization Through Time and Iterative Initialization and Retraining techniques. First, the BNTT improves low-latency and low-energy training in SNNs by allowing neurons to handle the spike rate over many timesteps. Second, we can obtain SNNs with up to unit latency pass during inference when applying the Iterative Initialization and Retraining technique during training SNNs. On the CIFAR-10, CIFAR-100, and ImageNet, we achieve cutting-edge SNN performance using a deep neural network with just one timestep. We achieve top-1 accuracy of 91.01%, 71.88%, and 69.8% on CIFAR-10, CIFAR-100, and ImageNet, respectively, using the VGG 16 architecture.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用时间批处理归一化和迭代初始化再训练来训练低延迟snn
脉冲神经网络(SNN)是在神经形态硬件上发展起来的一种有前途的高效节能人工智能范式。然而,由于高延迟、操作次数和获取膜电位的存储访问成本,多个时间步长的处理降低了snn的能量效益。此外,它们的非导数性质使得snn难以正确训练。为了克服这些问题并充分利用snn的潜力,在本研究中,我们提供了一种利用时间批处理归一化和迭代初始化和再训练技术来训练深度snn的新方法。首先,BNTT通过允许神经元处理多个时间步长的峰值速率,改善了snn的低延迟和低能量训练。其次,在训练snn时应用迭代初始化和再训练技术,我们可以在推理过程中获得具有最多单位延迟的snn。在CIFAR-10、CIFAR-100和ImageNet上,我们仅使用一个时间步长的深度神经网络实现了尖端的SNN性能。使用VGG - 16架构,我们在CIFAR-10、CIFAR-100和ImageNet上分别实现了91.01%、71.88%和69.8%的前一准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Development of AI Educational Datasets Library Using Synthetic Dataset Generation Method Channel Access Control Instead of Random Backoff Algorithm Illegal 3D Content Distribution Tracking System based on DNN Forensic Watermarking Deep Learning-based Spectral Efficiency Maximization in Massive MIMO-NOMA Systems with STAR-RIS Data Pipeline Design for Dangerous Driving Behavior Detection System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1