Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq
{"title":"基于谱和韵律特征提取的普什图语语音数字识别","authors":"Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq","doi":"10.1109/ICACI.2017.7974488","DOIUrl":null,"url":null,"abstract":"Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard Pashto digit corpus was available online. A database of 150 native speakers from 0 (sefor) to 9 (naha) including 75 males and 75 females is developed. Creation of corpus is one of the main contributions of this work. To the best of authors knowledge support vector machine (SVM) is used for the first time in Pashto digits classification and compared with K-nearest neighbor (KNN) classifier. Out of 150, 110 speakers feature set are used for training purposes and the remaining 40 speakers feature set are used for testing. The results obtained from the proposed method are very satisfactory and achieved 91.5% over all accuracy.","PeriodicalId":260701,"journal":{"name":"2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Pashto spoken digits recognition using spectral and prosodic based feature extraction\",\"authors\":\"Shibli Nisar, Ibrahim Shahzad, Muhammad Adnan Khan, Muhammad Tariq\",\"doi\":\"10.1109/ICACI.2017.7974488\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard Pashto digit corpus was available online. A database of 150 native speakers from 0 (sefor) to 9 (naha) including 75 males and 75 females is developed. Creation of corpus is one of the main contributions of this work. To the best of authors knowledge support vector machine (SVM) is used for the first time in Pashto digits classification and compared with K-nearest neighbor (KNN) classifier. Out of 150, 110 speakers feature set are used for training purposes and the remaining 40 speakers feature set are used for testing. The results obtained from the proposed method are very satisfactory and achieved 91.5% over all accuracy.\",\"PeriodicalId\":260701,\"journal\":{\"name\":\"2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACI.2017.7974488\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Ninth International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACI.2017.7974488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Pashto spoken digits recognition using spectral and prosodic based feature extraction
Automatic spoken digit recognition is one of the important areas in speech recognition. Local language spoken digits recognition is the next stage in this technological advancement. This paper presents a new approach for Pashto digits recognition using spectral and prosodic based feature extraction. Very little or almost no work has been done in Pashto spoken digit recognition. Thats why no standard Pashto digit corpus was available online. A database of 150 native speakers from 0 (sefor) to 9 (naha) including 75 males and 75 females is developed. Creation of corpus is one of the main contributions of this work. To the best of authors knowledge support vector machine (SVM) is used for the first time in Pashto digits classification and compared with K-nearest neighbor (KNN) classifier. Out of 150, 110 speakers feature set are used for training purposes and the remaining 40 speakers feature set are used for testing. The results obtained from the proposed method are very satisfactory and achieved 91.5% over all accuracy.