利用自适应局部竞争算法进行高效稀疏编码以实现语音分类

arXiv - EE - Audio and Speech Processing Pub Date : 2024-09-12 DOI:arxiv-2409.08188

Soufiyan Bahadi, Eric Plourde, Jean Rouat

{"title":"利用自适应局部竞争算法进行高效稀疏编码以实现语音分类","authors":"Soufiyan Bahadi, Eric Plourde, Jean Rouat","doi":"arxiv-2409.08188","DOIUrl":null,"url":null,"abstract":"Researchers are exploring novel computational paradigms such as sparse coding\nand neuromorphic computing to bridge the efficiency gap between the human brain\nand conventional computers in complex tasks. A key area of focus is\nneuromorphic audio processing. While the Locally Competitive Algorithm has\nemerged as a promising solution for sparse coding, offering potential for\nreal-time and low-power processing on neuromorphic hardware, its applications\nin neuromorphic speech classification have not been thoroughly studied. The\nAdaptive Locally Competitive Algorithm builds upon the Locally Competitive\nAlgorithm by dynamically adjusting the modulation parameters of the filter bank\nto fine-tune the filters' sensitivity. This adaptability enhances lateral\ninhibition, improving reconstruction quality, sparsity, and convergence time,\nwhich is crucial for real-time applications. This paper demonstrates the\npotential of the Locally Competitive Algorithm and its adaptive variant as\nrobust feature extractors for neuromorphic speech classification. Results show\nthat the Locally Competitive Algorithm achieves better speech classification\naccuracy at the expense of higher power consumption compared to the LAUSCHER\ncochlea model used for benchmarking. On the other hand, the Adaptive Locally\nCompetitive Algorithm mitigates this power consumption issue without\ncompromising the accuracy. The dynamic power consumption is reduced to a range\nof 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude\nless than setups using Graphics Processing Units. These findings position the\nAdaptive Locally Competitive Algorithm as a compelling solution for efficient\nspeech classification systems, promising substantial advancements in balancing\nspeech classification accuracy and power efficiency.","PeriodicalId":501284,"journal":{"name":"arXiv - EE - Audio and Speech Processing","volume":"4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification\",\"authors\":\"Soufiyan Bahadi, Eric Plourde, Jean Rouat\",\"doi\":\"arxiv-2409.08188\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Researchers are exploring novel computational paradigms such as sparse coding\\nand neuromorphic computing to bridge the efficiency gap between the human brain\\nand conventional computers in complex tasks. A key area of focus is\\nneuromorphic audio processing. While the Locally Competitive Algorithm has\\nemerged as a promising solution for sparse coding, offering potential for\\nreal-time and low-power processing on neuromorphic hardware, its applications\\nin neuromorphic speech classification have not been thoroughly studied. The\\nAdaptive Locally Competitive Algorithm builds upon the Locally Competitive\\nAlgorithm by dynamically adjusting the modulation parameters of the filter bank\\nto fine-tune the filters' sensitivity. This adaptability enhances lateral\\ninhibition, improving reconstruction quality, sparsity, and convergence time,\\nwhich is crucial for real-time applications. This paper demonstrates the\\npotential of the Locally Competitive Algorithm and its adaptive variant as\\nrobust feature extractors for neuromorphic speech classification. Results show\\nthat the Locally Competitive Algorithm achieves better speech classification\\naccuracy at the expense of higher power consumption compared to the LAUSCHER\\ncochlea model used for benchmarking. On the other hand, the Adaptive Locally\\nCompetitive Algorithm mitigates this power consumption issue without\\ncompromising the accuracy. The dynamic power consumption is reduced to a range\\nof 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude\\nless than setups using Graphics Processing Units. These findings position the\\nAdaptive Locally Competitive Algorithm as a compelling solution for efficient\\nspeech classification systems, promising substantial advancements in balancing\\nspeech classification accuracy and power efficiency.\",\"PeriodicalId\":501284,\"journal\":{\"name\":\"arXiv - EE - Audio and Speech Processing\",\"volume\":\"4 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - EE - Audio and Speech Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.08188\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - EE - Audio and Speech Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08188","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

研究人员正在探索稀疏编码和神经形态计算等新型计算范式，以缩小人脑与传统计算机在复杂任务中的效率差距。其中一个重点领域是神经形态音频处理。虽然局部竞争算法（Locally Competitive Algorithm）已成为稀疏编码的一种有前途的解决方案，为神经形态硬件上的实时和低功耗处理提供了潜力，但其在神经形态语音分类中的应用尚未得到深入研究。自适应局部竞争算法以局部竞争算法为基础，通过动态调整滤波器组的调制参数来微调滤波器的灵敏度。这种适应性增强了后期抑制，提高了重建质量、稀疏性和收敛时间，这对实时应用至关重要。本文展示了局部竞争算法及其自适应变体作为神经形态语音分类的稳健特征提取器的潜力。结果表明，与用于基准测试的 LAUSCHER 耳蜗模型相比，局部竞争算法以更高的功耗为代价，获得了更好的语音分类准确性。另一方面，自适应局部竞争算法在不影响准确性的情况下缓解了功耗问题。在神经形态硬件上，动态功耗降低到 0.004 到 13 毫瓦之间，比使用图形处理器的设置低三个数量级。这些发现将自适应局部竞争算法定位为高效语音分类系统的一个引人注目的解决方案，有望在平衡语音分类准确性和能效方面取得重大进展。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification

Researchers are exploring novel computational paradigms such as sparse coding and neuromorphic computing to bridge the efficiency gap between the human brain and conventional computers in complex tasks. A key area of focus is neuromorphic audio processing. While the Locally Competitive Algorithm has emerged as a promising solution for sparse coding, offering potential for real-time and low-power processing on neuromorphic hardware, its applications in neuromorphic speech classification have not been thoroughly studied. The Adaptive Locally Competitive Algorithm builds upon the Locally Competitive Algorithm by dynamically adjusting the modulation parameters of the filter bank to fine-tune the filters' sensitivity. This adaptability enhances lateral inhibition, improving reconstruction quality, sparsity, and convergence time, which is crucial for real-time applications. This paper demonstrates the potential of the Locally Competitive Algorithm and its adaptive variant as robust feature extractors for neuromorphic speech classification. Results show that the Locally Competitive Algorithm achieves better speech classification accuracy at the expense of higher power consumption compared to the LAUSCHER cochlea model used for benchmarking. On the other hand, the Adaptive Locally Competitive Algorithm mitigates this power consumption issue without compromising the accuracy. The dynamic power consumption is reduced to a range of 0.004 to 13 milliwatts on neuromorphic hardware, three orders of magnitude less than setups using Graphics Processing Units. These findings position the Adaptive Locally Competitive Algorithm as a compelling solution for efficient speech classification systems, promising substantial advancements in balancing speech classification accuracy and power efficiency.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

arXiv - EE - Audio and Speech Processing

自引率

0.00%

发文量