Chien-Yao Wang, Min Shih, Tzu-Chiang Tai, Po-Chuan Lin, Shih-Ting Huang, Jia-Hao Zhao, Jia-Ching Wang
{"title":"VLSI design for SC-based speaker recognition","authors":"Chien-Yao Wang, Min Shih, Tzu-Chiang Tai, Po-Chuan Lin, Shih-Ting Huang, Jia-Hao Zhao, Jia-Ching Wang","doi":"10.1109/ICIEA.2015.7334135","DOIUrl":null,"url":null,"abstract":"This work presents an efficient VLSI architecture design for sparse coding (SC)-based speaker recognition system. The proposed system first extracts the linear predictive cepstral coefficients (LPCCs). Then, we applied orthogonal matching pursuit (OMP) for sparse coding and using the sparse coefficients as feature to do classification task. To speed up the computation time, our proposed chip comprises a LPCC module and an OMP module. The LPCC module computes the linear predictive coefficients (LPCs) and then converts LPCs to LPCCs. The OMP module includes residual unit, atom selection unit, QR decomposition unit, triangular matrix inverse unit and matrix multiplication unit. This designed chip has ability to handle a large dictionary size for sparse coding in OMP modules. The prototype chip is implemented using TSMC 90 nm CMOS technology on a die with a size of approximately 1.9×1.9 mm2.","PeriodicalId":270660,"journal":{"name":"2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEA.2015.7334135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
This work presents an efficient VLSI architecture design for sparse coding (SC)-based speaker recognition system. The proposed system first extracts the linear predictive cepstral coefficients (LPCCs). Then, we applied orthogonal matching pursuit (OMP) for sparse coding and using the sparse coefficients as feature to do classification task. To speed up the computation time, our proposed chip comprises a LPCC module and an OMP module. The LPCC module computes the linear predictive coefficients (LPCs) and then converts LPCs to LPCCs. The OMP module includes residual unit, atom selection unit, QR decomposition unit, triangular matrix inverse unit and matrix multiplication unit. This designed chip has ability to handle a large dictionary size for sparse coding in OMP modules. The prototype chip is implemented using TSMC 90 nm CMOS technology on a die with a size of approximately 1.9×1.9 mm2.