利用自适应局部竞争算法进行高效稀疏编码以实现语音分类

Soufiyan Bahadi, Eric Plourde, Jean Rouat
{"title":"利用自适应局部竞争算法进行高效稀疏编码以实现语音分类","authors":"Soufiyan Bahadi, Eric Plourde, Jean Rouat","doi":"arxiv-2409.08188","DOIUrl":null,"url":null,"abstract":"Researchers are exploring novel computational paradigms such as sparse coding\nand neuromorphic computing to bridge the efficiency gap between the human brain\nand conventional computers in complex tasks. A key area of focus is\nneuromorphic audio processing. While the Locally Competitive Algorithm has\nemerged as a promising solution for sparse coding, offering potential for\nreal-time and low-power processing on neuromorphic hardware, its applications\nin neuromorphic speech classification have not been thoroughly studied. The\nAdaptive Locally Competitive Algorithm builds upon the Locally Competitive\nAlgorithm by dynamically adjusting the modulation parameters of the filter bank\nto fine-tune the filters' sensitivity. This adaptability enhances lateral\ninhibition, improving reconstruction quality, sparsity, and convergence time,\nwhich is crucial for real-time applications. This paper demonstrates the\npotential of the Locally Competitive Algorithm and its adaptive variant as\nrobust feature extractors for neuromorphic speech classification. Results show\nthat the Locally Competitive Algorithm achieves better speech classification\naccuracy at the expense of higher power consumption compared to the LAUSCHER\ncochlea model used for benchmarking. On the other hand, the Adaptive Locally\nCompetitive Algorithm mitigates this power consumption issue without\ncompromising the accuracy. The dynamic power consumption is reduced to a range\nof 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude\nless than setups using Graphics Processing Units. These findings position the\nAdaptive Locally Competitive Algorithm as a compelling solution for efficient\nspeech classification systems, promising substantial advancements in balancing\nspeech classification accuracy and power efficiency.","PeriodicalId":501284,"journal":{"name":"arXiv - EE - Audio and Speech Processing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification\",\"authors\":\"Soufiyan Bahadi, Eric Plourde, Jean Rouat\",\"doi\":\"arxiv-2409.08188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Researchers are exploring novel computational paradigms such as sparse coding\\nand neuromorphic computing to bridge the efficiency gap between the human brain\\nand conventional computers in complex tasks. A key area of focus is\\nneuromorphic audio processing. While the Locally Competitive Algorithm has\\nemerged as a promising solution for sparse coding, offering potential for\\nreal-time and low-power processing on neuromorphic hardware, its applications\\nin neuromorphic speech classification have not been thoroughly studied. The\\nAdaptive Locally Competitive Algorithm builds upon the Locally Competitive\\nAlgorithm by dynamically adjusting the modulation parameters of the filter bank\\nto fine-tune the filters' sensitivity. This adaptability enhances lateral\\ninhibition, improving reconstruction quality, sparsity, and convergence time,\\nwhich is crucial for real-time applications. This paper demonstrates the\\npotential of the Locally Competitive Algorithm and its adaptive variant as\\nrobust feature extractors for neuromorphic speech classification. Results show\\nthat the Locally Competitive Algorithm achieves better speech classification\\naccuracy at the expense of higher power consumption compared to the LAUSCHER\\ncochlea model used for benchmarking. On the other hand, the Adaptive Locally\\nCompetitive Algorithm mitigates this power consumption issue without\\ncompromising the accuracy. The dynamic power consumption is reduced to a range\\nof 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude\\nless than setups using Graphics Processing Units. These findings position the\\nAdaptive Locally Competitive Algorithm as a compelling solution for efficient\\nspeech classification systems, promising substantial advancements in balancing\\nspeech classification accuracy and power efficiency.\",\"PeriodicalId\":501284,\"journal\":{\"name\":\"arXiv - EE - Audio and Speech Processing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Audio and Speech Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Audio and Speech Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

研究人员正在探索稀疏编码和神经形态计算等新型计算范式,以缩小人脑与传统计算机在复杂任务中的效率差距。其中一个重点领域是神经形态音频处理。虽然局部竞争算法(Locally Competitive Algorithm)已成为稀疏编码的一种有前途的解决方案,为神经形态硬件上的实时和低功耗处理提供了潜力,但其在神经形态语音分类中的应用尚未得到深入研究。自适应局部竞争算法以局部竞争算法为基础,通过动态调整滤波器组的调制参数来微调滤波器的灵敏度。这种适应性增强了后期抑制,提高了重建质量、稀疏性和收敛时间,这对实时应用至关重要。本文展示了局部竞争算法及其自适应变体作为神经形态语音分类的稳健特征提取器的潜力。结果表明,与用于基准测试的 LAUSCHER 耳蜗模型相比,局部竞争算法以更高的功耗为代价,获得了更好的语音分类准确性。另一方面,自适应局部竞争算法在不影响准确性的情况下缓解了功耗问题。在神经形态硬件上,动态功耗降低到 0.004 到 13 毫瓦之间,比使用图形处理器的设置低三个数量级。这些发现将自适应局部竞争算法定位为高效语音分类系统的一个引人注目的解决方案,有望在平衡语音分类准确性和能效方面取得重大进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification
Researchers are exploring novel computational paradigms such as sparse coding and neuromorphic computing to bridge the efficiency gap between the human brain and conventional computers in complex tasks. A key area of focus is neuromorphic audio processing. While the Locally Competitive Algorithm has emerged as a promising solution for sparse coding, offering potential for real-time and low-power processing on neuromorphic hardware, its applications in neuromorphic speech classification have not been thoroughly studied. The Adaptive Locally Competitive Algorithm builds upon the Locally Competitive Algorithm by dynamically adjusting the modulation parameters of the filter bank to fine-tune the filters' sensitivity. This adaptability enhances lateral inhibition, improving reconstruction quality, sparsity, and convergence time, which is crucial for real-time applications. This paper demonstrates the potential of the Locally Competitive Algorithm and its adaptive variant as robust feature extractors for neuromorphic speech classification. Results show that the Locally Competitive Algorithm achieves better speech classification accuracy at the expense of higher power consumption compared to the LAUSCHER cochlea model used for benchmarking. On the other hand, the Adaptive Locally Competitive Algorithm mitigates this power consumption issue without compromising the accuracy. The dynamic power consumption is reduced to a range of 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude less than setups using Graphics Processing Units. These findings position the Adaptive Locally Competitive Algorithm as a compelling solution for efficient speech classification systems, promising substantial advancements in balancing speech classification accuracy and power efficiency.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploring an Inter-Pausal Unit (IPU) based Approach for Indic End-to-End TTS Systems Conformal Prediction for Manifold-based Source Localization with Gaussian Processes Insights into the Incorporation of Signal Information in Binaural Signal Matching with Wearable Microphone Arrays Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement Low Frame-rate Speech Codec: a Codec Designed for Fast High-quality Speech LLM Training and Inference
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1