{"title":"A Sub-400-nW Real-Time Event-Driven Spectrogram Extraction Unit in 28-nm FD-SOI CMOS for Keyword Spotting Application","authors":"Soufiane Mourrane;Benoit Larras;Sylvain Clerc;Andreia Cathelin;Antoine Frappé","doi":"10.1109/JSSC.2024.3475948","DOIUrl":null,"url":null,"abstract":"Considering the power-hungry nature of speech processing, a keyword spotting (KWS) unit, used to detect multiple spoken words, is often integrated as a front-end layer. KWS systems are always active, and thus, it is extremely important to optimize the devoted power budget. In this context, this article presents a programmable low-power event-driven real-time spectrogram extraction unit tested for the KWS application. This chip, fabricated in 28-nm FD-SOI CMOS technology, has been combined with a software-defined convolutional neural network to demonstrate the recognition of 11 audio classes (ten keywords + background + unknown) with an accuracy equal to 87.9% and an activity-dependent power consumption measured at 391.6 nW, for a 12-keyword/min average speech rate.","PeriodicalId":13129,"journal":{"name":"IEEE Journal of Solid-state Circuits","volume":"60 6","pages":"2060-2071"},"PeriodicalIF":5.6000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Solid-state Circuits","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10748395/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Considering the power-hungry nature of speech processing, a keyword spotting (KWS) unit, used to detect multiple spoken words, is often integrated as a front-end layer. KWS systems are always active, and thus, it is extremely important to optimize the devoted power budget. In this context, this article presents a programmable low-power event-driven real-time spectrogram extraction unit tested for the KWS application. This chip, fabricated in 28-nm FD-SOI CMOS technology, has been combined with a software-defined convolutional neural network to demonstrate the recognition of 11 audio classes (ten keywords + background + unknown) with an accuracy equal to 87.9% and an activity-dependent power consumption measured at 391.6 nW, for a 12-keyword/min average speech rate.
期刊介绍:
The IEEE Journal of Solid-State Circuits publishes papers each month in the broad area of solid-state circuits with particular emphasis on transistor-level design of integrated circuits. It also provides coverage of topics such as circuits modeling, technology, systems design, layout, and testing that relate directly to IC design. Integrated circuits and VLSI are of principal interest; material related to discrete circuit design is seldom published. Experimental verification is strongly encouraged.