Maximilian Baronig, Romain Ferrand, Silvester Sabathiel, Robert Legenstein
{"title":"Advancing Spatio-Temporal Processing in Spiking Neural Networks through Adaptation","authors":"Maximilian Baronig, Romain Ferrand, Silvester Sabathiel, Robert Legenstein","doi":"arxiv-2408.07517","DOIUrl":null,"url":null,"abstract":"Efficient implementations of spiking neural networks on neuromorphic hardware\npromise orders of magnitude less power consumption than their non-spiking\ncounterparts. The standard neuron model for spike-based computation on such\nneuromorphic systems has long been the leaky integrate-and-fire (LIF) neuron.\nAs a promising advancement, a computationally light augmentation of the LIF\nneuron model with an adaptation mechanism experienced a recent upswing in\npopularity, caused by demonstrations of its superior performance on\nspatio-temporal processing tasks. The root of the superiority of these\nso-called adaptive LIF neurons however, is not well understood. In this\narticle, we thoroughly analyze the dynamical, computational, and learning\nproperties of adaptive LIF neurons and networks thereof. We find that the\nfrequently observed stability problems during training of such networks can be\novercome by applying an alternative discretization method that results in\nprovably better stability properties than the commonly used Euler-Forward\nmethod. With this discretization, we achieved a new state-of-the-art\nperformance on common event-based benchmark datasets. We also show that the\nsuperiority of networks of adaptive LIF neurons extends to the prediction and\ngeneration of complex time series. Our further analysis of the computational\nproperties of networks of adaptive LIF neurons shows that they are particularly\nwell suited to exploit the spatio-temporal structure of input sequences.\nFurthermore, these networks are surprisingly robust to shifts of the mean input\nstrength and input spike rate, even when these shifts were not observed during\ntraining. As a consequence, high-performance networks can be obtained without\nany normalization techniques such as batch normalization or batch-normalization\nthrough time.","PeriodicalId":501347,"journal":{"name":"arXiv - CS - Neural and Evolutionary Computing","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Neural and Evolutionary Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.07517","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Efficient implementations of spiking neural networks on neuromorphic hardware
promise orders of magnitude less power consumption than their non-spiking
counterparts. The standard neuron model for spike-based computation on such
neuromorphic systems has long been the leaky integrate-and-fire (LIF) neuron.
As a promising advancement, a computationally light augmentation of the LIF
neuron model with an adaptation mechanism experienced a recent upswing in
popularity, caused by demonstrations of its superior performance on
spatio-temporal processing tasks. The root of the superiority of these
so-called adaptive LIF neurons however, is not well understood. In this
article, we thoroughly analyze the dynamical, computational, and learning
properties of adaptive LIF neurons and networks thereof. We find that the
frequently observed stability problems during training of such networks can be
overcome by applying an alternative discretization method that results in
provably better stability properties than the commonly used Euler-Forward
method. With this discretization, we achieved a new state-of-the-art
performance on common event-based benchmark datasets. We also show that the
superiority of networks of adaptive LIF neurons extends to the prediction and
generation of complex time series. Our further analysis of the computational
properties of networks of adaptive LIF neurons shows that they are particularly
well suited to exploit the spatio-temporal structure of input sequences.
Furthermore, these networks are surprisingly robust to shifts of the mean input
strength and input spike rate, even when these shifts were not observed during
training. As a consequence, high-performance networks can be obtained without
any normalization techniques such as batch normalization or batch-normalization
through time.