{"title":"基于频谱包络的马来语元音识别","authors":"Fadzilah Siraj, M. Azmi, M. Paulraj, S. Yaacob","doi":"10.1109/AMS.2009.152","DOIUrl":null,"url":null,"abstract":"Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice. In this paper, a new feature extraction method is presented to identify vowels recorded from 80 Malaysian speakers. The features are obtained from Vocal Tract Model based on Bandwidth (BW) approach. The bandwidth is determined by finding the frequency where the spectral energy is 3dB below the peak. Average gain was calculated from these bandwidths. Classification results from Bandwidth Approach were then compared with results from 14 MFCC Coefficients using BPNN (Backpropagation Neural Network), MLR (Multinomial Logistic Regression) and LDA (Linear Discriminative Analysis). Classification accuracy obtained shows Bandwidth Approach performs better than MFCC using all these classifiers.","PeriodicalId":6461,"journal":{"name":"2009 Third Asia International Conference on Modelling & Simulation","volume":"60 1","pages":"363-368"},"PeriodicalIF":0.0000,"publicationDate":"2009-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Malaysian Vowel Recognition Based on Spectral Envelope Using Bandwidth Approach\",\"authors\":\"Fadzilah Siraj, M. Azmi, M. Paulraj, S. Yaacob\",\"doi\":\"10.1109/AMS.2009.152\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice. In this paper, a new feature extraction method is presented to identify vowels recorded from 80 Malaysian speakers. The features are obtained from Vocal Tract Model based on Bandwidth (BW) approach. The bandwidth is determined by finding the frequency where the spectral energy is 3dB below the peak. Average gain was calculated from these bandwidths. Classification results from Bandwidth Approach were then compared with results from 14 MFCC Coefficients using BPNN (Backpropagation Neural Network), MLR (Multinomial Logistic Regression) and LDA (Linear Discriminative Analysis). Classification accuracy obtained shows Bandwidth Approach performs better than MFCC using all these classifiers.\",\"PeriodicalId\":6461,\"journal\":{\"name\":\"2009 Third Asia International Conference on Modelling & Simulation\",\"volume\":\"60 1\",\"pages\":\"363-368\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-05-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Third Asia International Conference on Modelling & Simulation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AMS.2009.152\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Third Asia International Conference on Modelling & Simulation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AMS.2009.152","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Malaysian Vowel Recognition Based on Spectral Envelope Using Bandwidth Approach
Automatic speech recognition (ASR) has made great strides with the development of digital signal processing hardware and software especially using English as the language of choice. In this paper, a new feature extraction method is presented to identify vowels recorded from 80 Malaysian speakers. The features are obtained from Vocal Tract Model based on Bandwidth (BW) approach. The bandwidth is determined by finding the frequency where the spectral energy is 3dB below the peak. Average gain was calculated from these bandwidths. Classification results from Bandwidth Approach were then compared with results from 14 MFCC Coefficients using BPNN (Backpropagation Neural Network), MLR (Multinomial Logistic Regression) and LDA (Linear Discriminative Analysis). Classification accuracy obtained shows Bandwidth Approach performs better than MFCC using all these classifiers.