{"title":"Neuromorphic Keyword Spotting with Pulse Density Modulation MEMS Microphones","authors":"Sidi Yaya Arnaud Yarga, Sean U. N. Wood","doi":"arxiv-2408.05156","DOIUrl":null,"url":null,"abstract":"The Keyword Spotting (KWS) task involves continuous audio stream monitoring\nto detect predefined words, requiring low energy devices for continuous\nprocessing. Neuromorphic devices effectively address this energy challenge.\nHowever, the general neuromorphic KWS pipeline, from microphone to Spiking\nNeural Network (SNN), entails multiple processing stages. Leveraging the\npopularity of Pulse Density Modulation (PDM) microphones in modern devices and\ntheir similarity to spiking neurons, we propose a direct microphone-to-SNN\nconnection. This approach eliminates intermediate stages, notably reducing\ncomputational costs. The system achieved an accuracy of 91.54\\% on the Google\nSpeech Command (GSC) dataset, surpassing the state-of-the-art for the Spiking\nSpeech Command (SSC) dataset which is a bio-inspired encoded GSC. Furthermore,\nthe observed sparsity in network activity and connectivity indicates potential\nfor remarkably low energy consumption in a neuromorphic device implementation.","PeriodicalId":501347,"journal":{"name":"arXiv - CS - Neural and Evolutionary Computing","volume":"26 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Neural and Evolutionary Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.05156","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The Keyword Spotting (KWS) task involves continuous audio stream monitoring
to detect predefined words, requiring low energy devices for continuous
processing. Neuromorphic devices effectively address this energy challenge.
However, the general neuromorphic KWS pipeline, from microphone to Spiking
Neural Network (SNN), entails multiple processing stages. Leveraging the
popularity of Pulse Density Modulation (PDM) microphones in modern devices and
their similarity to spiking neurons, we propose a direct microphone-to-SNN
connection. This approach eliminates intermediate stages, notably reducing
computational costs. The system achieved an accuracy of 91.54\% on the Google
Speech Command (GSC) dataset, surpassing the state-of-the-art for the Spiking
Speech Command (SSC) dataset which is a bio-inspired encoded GSC. Furthermore,
the observed sparsity in network activity and connectivity indicates potential
for remarkably low energy consumption in a neuromorphic device implementation.