{"title":"FPGA Implementation of Support Vector Machine Based Isolated Digit Recognition System","authors":"J. Manikandan, B. Venkataramani, V. Avanthi","doi":"10.1109/VLSI.Design.2009.23","DOIUrl":null,"url":null,"abstract":"In this paper, two schemes for FPGA implementation of multi-class SVM based isolated digit recognition system are proposed, one using only logic elements and another using both soft-core processor and logic elements(LEs). One of the major contributions of this paper is the proposal for implementation of the decision function using only fixed point arithmetic without compromising the recognition accuracy. Compared to the scheme which uses floating point arithmetic, the proposed scheme reduces the number of LEs required by a factor of 3.29. The second scheme proposed results in about 25 times lower area compared to the first scheme. For the soft-core processor approach, a custom instruction is proposed for floating point arithmetic. Speaker dependent TI46 database of isolated digits is used for training and testing. Features are extracted using both Linear Predictive Coefficients (LPC) and Mel Frequency Cepstral Coefficients(MFCC) and features are compressed using Self Organized Feature Mapping (SOFM). This in turn is used by the SVM classifier to evaluate the recognition accuracy and the hardware resources utilized. Both the schemes proposed result in 100% recognition accuracy when implemented on Altera Cyclone II FPGA. The proposed schemes can also be used for speaker verification and speaker authentication applications. Since the scheme which uses soft-core processor requires lower area, it can be used for systems which require a large vocabulary size.","PeriodicalId":267121,"journal":{"name":"2009 22nd International Conference on VLSI Design","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-01-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 22nd International Conference on VLSI Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VLSI.Design.2009.23","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 35
Abstract
In this paper, two schemes for FPGA implementation of multi-class SVM based isolated digit recognition system are proposed, one using only logic elements and another using both soft-core processor and logic elements(LEs). One of the major contributions of this paper is the proposal for implementation of the decision function using only fixed point arithmetic without compromising the recognition accuracy. Compared to the scheme which uses floating point arithmetic, the proposed scheme reduces the number of LEs required by a factor of 3.29. The second scheme proposed results in about 25 times lower area compared to the first scheme. For the soft-core processor approach, a custom instruction is proposed for floating point arithmetic. Speaker dependent TI46 database of isolated digits is used for training and testing. Features are extracted using both Linear Predictive Coefficients (LPC) and Mel Frequency Cepstral Coefficients(MFCC) and features are compressed using Self Organized Feature Mapping (SOFM). This in turn is used by the SVM classifier to evaluate the recognition accuracy and the hardware resources utilized. Both the schemes proposed result in 100% recognition accuracy when implemented on Altera Cyclone II FPGA. The proposed schemes can also be used for speaker verification and speaker authentication applications. Since the scheme which uses soft-core processor requires lower area, it can be used for systems which require a large vocabulary size.
本文提出了两种FPGA实现基于多类支持向量机的隔离数字识别系统的方案,一种方案仅使用逻辑元件,另一种方案同时使用软核处理器和逻辑元件。本文的主要贡献之一是在不影响识别精度的情况下,仅使用不动点算法实现决策函数。与使用浮点算法的方案相比,所提出的方案将所需的le数量减少了3.29倍。与第一方案相比,第二方案的面积减少了约25倍。对于软核处理器方法,提出了一个自定义的浮点运算指令。与说话人相关的TI46孤立数字数据库用于训练和测试。使用线性预测系数(LPC)和Mel频率倒谱系数(MFCC)提取特征,并使用自组织特征映射(SOFM)压缩特征。支持向量机分类器利用这一数据来评估识别精度和所使用的硬件资源。在Altera Cyclone II FPGA上实现后,两种方案的识别准确率均达到100%。所提出的方案也可用于说话人验证和说话人身份验证应用。由于采用软核处理器的方案占用的空间较小,因此可用于对词汇量要求较大的系统。