{"title":"Deep Convolution Neural Network Based Speech Recognition for Chhattisgarhi","authors":"N. Londhe, G. B. Kshirsagar, Hitesh Tekchandani","doi":"10.1109/SPIN.2018.8474064","DOIUrl":null,"url":null,"abstract":"The existing ASR for Chhattisgarhi using conventional machine learning technique was implemented for speaker dependent speech recognition. However, the conventional machine learning based speech recognition is incapable to handle the spectral variations as well as the spectral correlation of acoustic signals. Therefore, to overcome the aforementioned limitations, authors have implemented the deep convolution neural network (DCNN) based ASR for Chhattisgarhi dialect. Unlike other deep learning models, DCNN can efficiently handle the spectral variations and spectral correlation of speech signal with the less computational burden. The experiment of isolated Chhattisgarhi word recognition was implemented on self-recorded dataset acquired from 150 subjects from various geographical parts of Chhattisgarh state. The implemented algorithm is promisingly achieving 99.49% of accuracy for isolated word recognition. The different performance paraments are presented to validate the performed experiment.","PeriodicalId":184596,"journal":{"name":"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Signal Processing and Integrated Networks (SPIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIN.2018.8474064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
The existing ASR for Chhattisgarhi using conventional machine learning technique was implemented for speaker dependent speech recognition. However, the conventional machine learning based speech recognition is incapable to handle the spectral variations as well as the spectral correlation of acoustic signals. Therefore, to overcome the aforementioned limitations, authors have implemented the deep convolution neural network (DCNN) based ASR for Chhattisgarhi dialect. Unlike other deep learning models, DCNN can efficiently handle the spectral variations and spectral correlation of speech signal with the less computational burden. The experiment of isolated Chhattisgarhi word recognition was implemented on self-recorded dataset acquired from 150 subjects from various geographical parts of Chhattisgarh state. The implemented algorithm is promisingly achieving 99.49% of accuracy for isolated word recognition. The different performance paraments are presented to validate the performed experiment.